This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the 
original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problems Mailbox. 



PCTi 



WORL 



.LECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 

C12N 15/63, 15/10, 15/81, C12Q 1>68 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 98/13502 

2 April 1998 (02.04.98) 



(21) International Application Number: 

(22) International Filing Date: 26 September 1997 (26.09.97) 



PCT/US97/ 17276 j (81) Designated States: AU, BR, CA, CN\ CZ. M, HU, IL, JP, MX 
NO. PL. RU, SK. European patent (AT, BE, CH DE DK 
ES, Fl. FR, GB, GR. IE, IT. LU, MC, NL, PT, SK). ' 



I (30) Priority Data: 

08/721730 



27 September 1996 (27.09.96) US 



(71) Applicant: ICOS CORPORATION [US/US J 2202! 20th 
Avenue. S.E.. Bothell, WA 98021 (US). 

(72) Inventors: GOODMAN, Richard, R; 18560 Westview Drive 
Lake Oswego. OR 97034-7382 (US). HOEKSTRA Merf 
F.; 10321 216th. S.E., Snohomish. WA 98290 (US). 

(74)A B ent: WILLIAMS, Joseph. A., jr.; Marshall, O'Toole 
Gerstem. Murray & Borun, 6300 Sears Tower. 233 S 
Wacker Drive, Chicago, IL 60606-6402 (US). 



Published 

Without international search report and to be republished 
upon receipt of thai report. 



(54) Title: METHOD TO IDENTIFY COMPOUNDS FOR DISRUPTING PROTEIN/PROTEIN INTERACTIONS 
(57) Abstract 

The present invention relates generally to materials 
and methods for identification of inhibitors of interactions 
between known binding partner proteins. 




Tet op L 



IS3 



] ► No Growth 




Tet op 



HIS 3 



►Growth 



SNSDOC:3: <WO 9813S02A2_I_> 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used ,o identify States party to the PCT on the from pages of pamphlets publishing international app.ications under the PCT. 



AL 


Albania 


ES 


AM 


Armenia 


Ft 


AT 


Austria 


FR 


AU 


Australia 


GA 


A/ 


Azerbaijan 


GB 


BA 


Bosnia aiKf Herzegovina 


GE 


BB 


Barbados 


GH 


BE 


Belgium 


GN 


BF 


Burkina Faso 


GR 


BG 


Bulgaria 


HU 


BJ 


Benin 


IK 


BR 


Brazil 


IL 


BY 


Belarus 


IS 


CA 


Canada 


IT 


CY 


Central African Republic 


JP 


CC 


Congo 


KE 


CH 


Switzerland 


KG 


CI 


Cote d* I voire 


KP 


CM 


Cameroon 


CN 


China 


KR 


cv 


Cuba 


KZ 


C7. 


Czech Repuh Ik- 


LC 


UK 


Germany 


U 


DK 


Denmark 


LK 


KK 


Estonia 





Spain 
Finland 
France 
Gabon 

United Kingdom 

Georgia 

Ghana 

Guinea 

Greece 

Hungary 

Ireland 

Israel 

Iceland 

Italy 

Japan 

Kenya 

Kyrgyzstan 

Democratic People's 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Lucia 

Liechtenstein 

Sri Lanka 

Liberia 



LS Lesotho 

LT Lithuania 

LU Luxembourg 

LV Latvia 

MC Monaco 

MD Republic of Moldova 

MC Madagascar 

MK The former Yugoslav 

Republic of Macedonia 

MI, Mali 

MN Mongolia 

MR Mauritania 

MW Malawi 

MX Mexico 

NE Niger 

NL Netherlands 

NO Norway 

NZ New Zealand 

PI- Poland 

PT Portugal 

RO Romania 

RU Russian Federation 

SO Sudan 

SE Sweden 

SG Singapore 



SI Slovmia 

SK Slovakia 

SN Senegal 

SZ Swaziland 

TIJ Chac 

TG Togc 

TJ Tajik istan 

TM Turkmenistan 

TR Turkey 

TT Trinidad and Tobago 

UA Ukraine 

VG Uganda 

US Unt«:d States of A merica 

VZ Uzbekistan 

VN Viet Nam 

YU Yugoslavia 

£VV Zimbabwe 



WO 9813502A2_I_> 



WO 98/13502 



PCT/US97/17276 



METHODS TO IDENTIFY COMPOUNDS FOR 
DISRUPTING PROTEIN/PROTEIN INTERACTIONS 

Background of the Invention 
The present invention relates to a novel method to identify 
5 inhibitors of protein/protein interactions. 
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Background 

Modulation of protein/protein interactions is an attractive target 
for drug discovery and development. Potential methods by which drugs can 
regulate protein/protein interactions are numerous, including, for example, 
regulation of expression of one or more of the binding proteins, modulation 
of post-translational modification, and direct interference with the capacity of 
one protein to bind to one or more binding partners. More importantly, 
recent observations make it increasingly clear that supramolecular protein 
complexes, involving two or more binding proteins, play an important and 
essential roles in signal transduction, gene expression, cell proliferation and 
duplication, and cell cycle progression. For example, in the repair of UV 
damaged DNA, a so-called "repairsome" that contains over ten individual 
proteins is assembled into a complex which can then carry out the necessary 
repair. Likewise, gene transcription occurs through the concerted action of 
greater than twenty proteins. Signal transduction proteins, such as receptor 
protein kinases, are part of large complexes with many proteins. Contacts 
through Src homology type 2 (SH2) domains on the receptor kinases, for 
example, are noteworthy protein interaction which are part of one or more 
enzymatic cascade important for many metabolic processes. Disrupting the 
binding capacity of one or more proteins which form any of these larger 
complex is therefore an important and untapped method to control action of 
the overall complex. 

Protein/protein interactions have been discovered and 
characterized by a variety of methods: (i) standard biochemical affinity 
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methods such as chromatography or co-immunoprecipitations; (ii) gel overlay 
methods; (iii) co-purification by traditional biochemistry; and (iv) two-hybrid 
analysis [Fields and Song, Nature 340:245-246 (1989); Fields, Methods: A 
Companion to Methods in Enzymology 5: 1 16-124 (1993); U.S. Patent 5,283, 
5 173 issued February 1, 1994 to Fields, et ai]. The most recent of these 
approaches, the two hybrid method, has enjoyed broad application because of 
its relative ease of use for gene identification from cDNA fusion libraries. 
[See Chien et al. , Proc. Natl. Acad. Set. (USA) 88:9578-9582 (1991); Dalton 
andTreisman, Cell 72:223-232 (1993); and Durfee, etal., Genes and Devel 
10 7:555-569 (1993)]. 

The two hybrid system is based on targeting and identifying a 
protein/protein interaction through the use of a reporter system. The 
described two hybrid systems either use the yeast Gal4 DNA binding domain 
or the E. coli lexA DNA binding domain and couple this region to a 
transcriptional activator such as Gal4 or VP16 that drives a reporter like & 
galactosidase or HIS3. 

In principle the two hybrid assay could be used for drug 
screening. [See WO 96/03501 and WO 96/03499 ] In such a scenario, loss 
of /? galactosidase or fflS3 activity would be identified after the yeast strain 
is treated with a compound. In practice, however, use of the two hybrid 
system is technically undesirable for several reasons. In instances where the 
0 galactosidase or fflS3 protein arc employed as the reporter protein, a loss 
of activity is particularly difficult to detect because the expressed reporter 
protein is too long lived to be used in a high throughput mode. If a candidate 
binding inhibitor compound is metabolized faster than the previously expressed 
reporter protein is turned over, it is difficult to detect inhibitory action of the 
candidate drug while a reporter protein is still active. In high throughput 
screening, the loss of a positive signal, for example, /? galactosidase or IHS3 
is impossible to detect. Present robotocized screening and detection methods 
are simply not sufficiently sensitive or robust to detect loss of a signal. 
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Thus there is a need in the art to develop a rapid screening 
method that gives a positive signal, as opposed to a negative signal, when a 
protein/protein interaction is disrupted. Such a system must be capable of 
using protein interactions that are initially detected by any of the above 
mentioned approaches and must be sufficiently robust to detect a gain of 
function when a protein interaction is lost. In essence, the screening method 
must give a signal when an interaction is lost, not lose a signal when an 
interaction is lost. Such a system must be sensitive to subtle interactions, in 
particular ones that are caused by post-trans lational modification like protein 
phosphorylation. Finally for large scale screening, such as high throughput 
screening, the system must be manipulable such that a large signal-to-noise 
ration can be easily detected. 



Brief Summ ary of the Invention 
In one aspect, the present invention provides materials that are 
15 useful for the identification of compounds which inhibit interaction between 
known binding partner proteins. See Figure 1. The invention provides host 
cells transformed or transfected with DNA comprising: (i) a repressor gene 
encoding DNA binding protein that acts as a repressor protein, said repressor 
gene under transcriptional control of a promoter; (ii) a selectable marker gene 
20 encoding a selectable marker protein; said selectable marker gene under 
transcriptional control of an operator; said operator regulated by interaction 
with said repressor protein; (Hi) a first recombinant fusion protein gene 
encoding a first binding protein or binding fragment thereof in frame with 
either a DNA binding domain of a transcriptional activating protein or a 
transactivating domain of a transcriptional activating protein; and (iv) a second 
recombinant fusion protein gene encoding a second binding protein or binding 
fragment thereof in frame with either a DNA binding domain of a 
transcriptional activating protein or a transactivating domain of a 
transcriptional activating protein, whichever domain is not encoded by the first 
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fusion protein gene, said second binding protein or binding fragment thereof 
capable of interacting with said first binding protein or binding fragment 
thereof such that interaction of said second binding protein or binding 
fragment thereof and said first binding protein or binding fragment thereof 
brings into proximity a DNA binding domain and a transacting domain 
forming a functional transcriptional activating protein; said functional 
transcriptional activating protein acting on said promoter to increase 
expression of said repressor gene. 

The invention comprehends host ceils wherein the various genes 
and regulatory sequences are encoded on a single DNA molecule as well as 
host cells wherein one or more of the repressor gene, the selectable marker 
gene, the first recombinant fusion protein gene, and the second recombinant 
fusion protein gene are encoded on distinct DNA expression constructs. In 
a preferred embodiment, the host cells are transformed or transfected with 
DNA encoding the repressor gene, the selectable marker gene, the first 
recombinant fusion protein gene, and the second recombinant fusion protein 
gene, each encoded on a distinct expression construct. Regardless or* the 
number of DNA expression constructs introduced, each transformed or 
transfected DNA expression construct further comprises a selectable marker 
gene sequence, the expression of which is used to confirm that transfection or 
transformation was, in fact, accomplished. Selectable marker genes encoded 
on individually transformed or transfected DNA expression constructs are 
distinguishable from the selectable marker under transcriptional regulation of 
the ret operator in that expression of the selectable marker gene regulated by 
the tet operator is central to the preferred embodiment; i.e., regulated 
expression of the selectable marker gene by the tet operator provides a 
measurable phenotypic change in the host cell that is used to identify a binding 
protein inhibitor. Selectable marker genes encoded on individually 
transformed or transfected DNA expression constructs are provided as 
determinants of successful transfection or transformation of the individual 
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DNA expression constructs. Preferred host cells of the invention include 
transformed S. cerevisiae strains designated YI596 and YI584 which were 
deposited August 13, 1996 with the American Type Culture Collection 
(ATCC), 12301 Parklawn Drive, Rockville, Maryland 20852, and assigned 
5 Accession Numbers ATCC 74384 and ATCC 74385, respectively. 

The host cells of the invention include any cell type capable of 
expressing the heterologous proteins required as described above and which 
are capable of being transformed or transfected with functional promoter and 
operator sequences which regulate expression of the heterologous proteins also 
1 0 as described. In a preferred embodiment, the host cells are of either mammal, 
insect or yeast origin. Presently, the most preferred host cell is a yeast cell. 
The preferred yeast cells of the invention can be selected from various strains, 
including the S. cerevisiae yeast transformants described in Table 1. 
Alternative yeast specimens include S.pombe, K.lactis, P.pastoris, 
15 S.carlsbergensis and C.albicans. Preferred mammalian host cells of the 
invention include Chinese hamster ovary (CHO), COS, HeLa, 3T3, CV1, 
LTK, 293T3, Rati, PC12 or any other transfectable cell line of human or 
rodent origin. Preferred insect cells lines include SF9 cells. 

In a preferred embodiment, the selectable marker gene is 
regulated by an operator and encodes an enzyme in a pathway for synthesis 
of a nutritional requirement for said host cell such that expression of said 
selectable marker protein is required for growth of said host cell on media 
lacking said nutritional requirement. Thus, as in a preferred embodiment 
where a repressor protein interacts with the operator, transcription of the 
25 selectable marker gene is down-regulated and the host cells are identified by 
an inability to grow on media lacking the nutritional requirement and an 
ability to grow on media containing the nutritional requirement. In a most 
preferred embodiment, the selectable marker gene encodes the HIS3 protein, 
and host cells transformed or transfected with a HIS3-encoding DNA 
expression construct are selected following growth on media in the presence 
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and absence of histidinc. The invention, however, comprehends any of a 
number of alternative selectable marker genes regulated by an operator. Gene 
alternatives include, for example URA3, LEU2, LYS2 or those encoding any 
of the multitude of enzymes required in various pathways for production of 
a nutritional requirement which can be definitively excluded from the media 
of growth. In addition, conventional reporter genes such as chloramphenicol 
acetyltransferase (CAT), firefly luciferase, /3-galactosidase (0-gal), secreted 
alkaline phosphatase (SEAP), green fluorescent protein (GFP), human growth 
hormone (hGH), ^-glucuronidase, neomycin, hygromycin, thymidine kinase 
(TK) and the like may be utilized in the invention. 

In the preferred embodiment, the host cells include a repressor 
protein gene encoding the tetracycline resistance protein which acts on the ret 
operator to decrease expression of the selectable marker gene. The invention, 
however, also encompasses alternatives to the let repressor and operator, for 
15 example, E. coli trp repressor and operator, his repressor and operator, and lac 
operon repressor and operator. 

The DNA binding domain and transactivating domain 
components of the fusion protein may be derived from the same transcription 
factor or from different transcription factors as long as bringing the two 
20 domains into proximity permits formation of a functional transcriptional 
activity protein that increases expression of the repressor protein with high 
efficiency. A high efficiency transcriptional activating protein is defined as 
having both a DNA binding domain exhibiting high affinity binding for the 
recognized promoter sequence and a transactivating domain having high 
25 affinity binding for transcriptional machinery proteins required to express 
repressor gene mRNA. The DNA binding domain component of a fusion 
protein of the invention can be derived from any of a number of different 
proteins including, for example, LexA orGal4. Similarly, the transactivating 
component of the invention's fusion proteins can be derived from a number 
30 of different transcriptional activating proteins, including for example, Gal4 or 
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VP16. In one embodiment of the invention, polynucleotides encoding 
binding partner proteins CREB and CBD are inserted in plasmids pVP16- 
CREB and pLexA-CBD, respectively, which were deposited with the ATCC 
and assigned Accession Numbers ATCC 98138 and ATCC 98139, 
respectively. 

The promoter sequence of the invention which regulates 
transcription of the repressor protein can be any sequence capable of driving 
transcription in the chosen host cell. The promoter may be a DNA sequence 
specifically recognized by the chosen DNA binding domain of the invention, 
or any other DNA sequence with which the DNA binding domain of the 
fusion protein is capable of high affinity interaction. In a preferred 
embodiment of the invention, the promoter sequence of the invention is either 
a HIS3 or alcohol dehydrogenase (ADH) promoter. In a presently most 
preferred embodiment, the ADH promotor is employed in the invention. The 
invention, however, encompasses numerous alternative promoters, including, 
for example, those derived from genes encoding HIS3, ADH, URA3, LEU2 
and the like. 

In another aspect, the invention provides methods to identify 
molecules that inhibit interaction between known binding partner proteins. In 
20 one embodiment, the invention provides a method to identify an inhibitor of 
binding between a first binding protein or binding fragment thereof and a 
second binding protein or binding fragment thereof comprising the steps of (a) 
growing host cells transformed or transfected as described above in the 
absence of a test compound and under conditions which permit expression of 
said first binding protein or binding fragment thereof and said second binding 
protein or binding fragment thereof such that said first binding protein or 
fragment thereof and second binding protein or binding fragment thereof 
interact bringing into proximity said DNA binding domain and said 
transactivating domain forming a functional transcriptional activating protein; 
the transcriptional activating protein acting on said promoter to increase 
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expression of said repressor protein; said repressor protein interacting with 
said operator such that said selectable marker protein is not expressed; (b) 
confining lack of expression of said selectable marker protein in said host 
cell; ( C ) growing said host cells in the presence of a test compound; and (d) 
comparing expression of said selectable marker protein in the presence and 
absence of said test compound wherein increased expression of said selectable 
marker protein is indicative that the test compound is an inhibitor of binding 
between said first binding protein or binding fragment thereof and said second 
binding protein or binding fragment thereof. 

In a most preferred embodiment, the invention provides a 
method to identify an inhibitor of binding between a first binding protein or 
binding fragment thereof and a second binding protein or binding fragment 
thereof comprising the steps of: (a) transforming or transfecting a host cell 
with a first DNA expression construct comprising a first selectable marker 
gene encoding a first selectable marker protein and a repressor gene encoding 
a repressor protein, said repressor gene under transcriptional control of a 
promoter; (b) transforming or transfecting said host cell with a second DNA 
expression construct comprising a second selectable marker gene encoding a 
second selectable marker protein and a third selectable marker gene encoding 
a third selectable marker protein, said third selectable marker gene under 
transcriptional control of an operator, said operator specifically acted upon by 
said repressor protein such that interaction of said repressor protein with said 
operator decreases expression of said third selectable marker protein; (c) 
transforming or transfecting said host cell with a third DNA expression 
construct comprising a fourth selectable marker gene encoding a fourth 
selectable marker protein and a first fusion protein gene encoding a first 
binding protein or binding fragment thereof in frame with either a DNA 
binding domain of a transcriptional activation protein or a transactivating 
domain of said transcriptional activation protein; (d) transforming or 
transfecting said host cell with a fourth DNA expression construct comprising 



WO 98/13502 



PCT/US97/17276 



- 9 - 

a fifth selectable marker gene encoding a fifth selectable marker protein and 
a second fusion protein gene encoding a second binding protein or binding 
fragment thereof in frame with either the DNA binding domain of said 
transcriptional activation protein or the transacting domain of said 
transcriptional activation protein, whichever is not included in first fusion 
protein gene; (e) growing said host cell under conditions which permit 
expression of said first binding protein or fragment thereof and said second 
binding protein or fragment thereof such that said first binding protein or 
fragment thereof and second binding protein or binding fragment thereof 
mteract bringing into proximity said DNA binding domain and said 
transact.vating domain reconstituting said transcriptional activating protein- 
said transcriptional activating protein acting on said promoter to increase 
expression of said repressor protein; said repressor protein interacting with 
sa,d operator such that said third selectable marker protein is not expressed- 
(0 detecting absence of expression of said selectable gene; (g) growing said 
host cell ,n the presence of a test compound of binding between said first 
protein or fragment thereof and said second binding protein or fragment 
thereof; and (h) comparing expression of said selectable marker protein in the 
presence and absence of said test compound wherein decreased expression of 
sa,d selectable marker protein is indicative of an ability of the test compound 
to mh,b,t binding between said first binding protein or binding fragment 
thereof and said second binding protein or binding fragment thereof such that 
said transcriptional activating protein is not reconstituted, expression of said 
repressor protein is not increased, and said operator increases expression of 
said selectable marker protein. 

The methods of the invention encompass any and all of the 
variations in host cells as described above. In particular, the invention 
encompasses a method wherein: the host cell is a yeast cell; the selectable 
marker gene encodes HIS3; transcription of the selectable marker gene is 
regulated by the ret operator; the repressor protein gene encodes the 
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tetracycline resistance protein; transcription of the tetracycline resistance 
protein is regulated by the HIS3 promoter; the DNA binding domain is 
derived from LexA; and the transacting domain is derived from VP16. 
In another embodiment, the invention encompasses a method wherein: the host 
cell is a yeast cell; the selectable marker gene encodes HIS3; transcription of 
the selectable marker gene is regulated by the tet operator; the repressor 
protein gene encodes the tetracycline resistance protein; transcription of the 
tetracycline resistance protein is regulated by the alcohol dehydrogenase 
promoter; the DNA binding domain is derived from LexA; and the 
transactivating domain is derived from VPI6. 

In alternative embodiments of the invention wherein the host 
cell is a mammalian cell, variations include the use of mammalian DNA 
expression constructs to encode the first and second recombinant fusion genes, 
the repressor gene, and the selectable marker gene, and use of selectable 
marker genes encoding antibiotic or drug resistance markers (i.e.. neomycin, 
hygromycin, thymidine kinase). 

There are at least three different types of libraries used for the 
identification of small molecule modulators. These include: (1) chemical 
libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

Chemical libraries consist of structural analogs of known 
compounds or compounds that are identified as "hits" via natural product 
screening. Natural product libraries are collections of microorganisms, 
animals plants or marine organisms which are used to create mixtures for 
screening by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of plants or marine organisms. 
Combinatorial libraries are composed of large numbers of peptides, 
oligonucleotides or organic compounds as a mixture. They are relatively easy 
to prepare by traditional automated synthesis methods, PCR, cloning or 
proprietary synthetic methods. Of particular interest are peptide and 
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oligonucleotide combinatorial libraries. Still other libraries of interest include 
peptide, protein, peptidomimetic, multiparallel synthetic collection, 
recombinatorial, polypeptide libraries. 

The utility of the various aspects of the invention is manifest. 
Host cells of the invention are useful to demonstrate in vivo binding capacity 
of both known and suspected binding partner proteins in a recombinant 
system. Such an expression system permits systematic analysis of the 
structure and function of a particular binding protein, thus permitting 
identification and/or synthesis of potential modulators of the physiological 
activity of the binding proteins. The methods of the invention are particularly 
useful to identify and improve molecules which are capable of inhibiting 
specific and general protein/protein interactions. Inhibitors identified by the 
methods of the invention can then be examined for utility in vivo as 
therapeutic and/or prophylactic medicaments for conditions associated with 
15 various protein/protein interactions. 

/ 

Description pf the Drawing 

Figure 1 describes the mechanics of the split hybrid assays. 

Detailed Descriptinn of th«> Invpntinn 
The present invention relates generally to methods designated 
split hybrid assays to identify inhibitors of protein/protein interactions and is 
illustrated by the following examples describing various methods for making 
and using the invention. In particular, Sample 1 relates to construction of 
various plasmids and expression constructs utilized in the invention. Example 
2 described generation of various yeast transformants used to identify inhibitor 
compounds. Examples 3, 4, 5 and 6 address use of the split hybrid assay to 
examine CREB/CBD binding, Tax/SRF binding, CKI/CREB binding and 
AKAP 79 binding to various partner protein, respectively. Example 7 
describe general application of the split hybrid assay. Example 8 relates to 
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use of the split hybrid assay for weakly interacting binding partners. Example 
9 describes general assay methods. Example 10 addresses use of the split 
hybrids assay to identify agents that prevent receptor desensitization and drug 
tachyphylaxis. 
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Example 1 
Plasmid Construction 

In the examples that follow, various plasmid constructs were 

utilized as described. To simplify discussion of the exemplified assays, this 

example describes construction of the various plasmids used in the following 

examples. For clarity, the plasmids are grouped according common features 

relating to their applications in the assays later discussed. 

I- Plasmids Encoding Reporter dene HIS3 
A- pRS303/lxteton-MlnT 

One copy of the tet operator sequence was engineered into 
position -53 in the HIS3 promoter of pRS313 [Sikorski, R.S. et aL, Genetics 
122:19-27 (1989)] by using the polymerase chain reaction (PCR). Two 
primary PCR reactions using pRS313 as a template were performed which 
utilized a 5'-terminal oligonucleotide designated Eco47m-5' and a 3'-inner 
oligonucleotide designated Tetop internal 3' to yield a primary 5'-PCR product 
and a 5'-inner oligonucleotide designated Tetop internal 5 ' and a 3'-terminal 
oligonucleotide designated Nhe I 3' to yield a primary 3'-PCR product. 

Eco47 m-5' SEQ ID NO: 1 

5 ' -TTGGTGAGCGCTAGG AGTCACTGCCAG 

Tetop int. 3' SHQ ID NO- 2 

25 5 ' -TATACTCTATCAATGATAG AGTA ATTCATTATGTG ATAATGCC 
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Tetop int. 5' SEQ ID NO: 3 

TACTCTATCATTGATAGAGTATATAAAGTAATGTGATTTC) 
Nhe I 3' 



SEQ ID NO: 4 
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5'-AATTCTGCTAGCCTCTGCAAAGC 



5' and 3' inner oligonucleotides contain complementary sequence such that 3' 
sequence of the primary 5' PCR product overlaps with 5' sequence of the 
primary 3' PCR product. The 5' terminal oligonucleotide contains the 
restriction site Eco47Ul while the 3' terminal oligonucleotide contains the 
restriction site Nhel in order to facilitate subsequent subcloning. The primary 
PCR reactions were performed with PJu DNA polymerase (Stratagene, La 
Jolla, CA) using reaction conditions described by the manufacturer. PCR 
products were isolated by Biol 01 (Vista, CA) Gene Clean m gel extraction. 
The primary 5' and 3' PCR products were then combined in a second PCR 
reaction and amplified using the 5'- and 3'- terminal oligonucleotides, 
Eco47m-5' and Nhe 1 3'. The second PCR reaction was performed with Vent 
DNA polymerase (New England Bioiabs, Beverly, MA) using reaction 
conditions described by the manufacturer, except that the reactions were 
supplemented with 4 mM Mg2+. The final PCR product contained one tet 
operator sequence inserted into position -53 of the HIS3 promoter and 
nucleotides 52-48 deleted in the construction. The final PCR product was 
isolated, digested with Eco47ia and Nhel and cloned into pRS313 previously 
digested with Eco47m and Nhel. The resulting plasmid was designated 
PRS313/1 xtetop. DNA sequencing confirmed the presence of one copy of the 
tet operator sequence in pRS3 13/1 xtetop and confirmed integrity of the 
EcoAim and Nhel junctions. 

AA/M restriction enzyme site was engineered into position -22 
in the HIS3 promoter of pRS3 13/1 xtetop by utilizing PCR using Vent DNA 
polymerase using P RS3 13/1 xtetop as template. One PCR construct was 
amplified using the 5' terminal oligonucleotide Eco47 m-5' (SEQ ID NO: 1) 
containing an EcoATUI restriction site and a 3 '-oligonucleotide designated Mlu 
I 3' containing a Mlul restriction site. 
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Mlu 1 3 ' SEQ ID NO: 5 

5 ' CGCACGCGTCG AAGAAATCAC ATTACTTTATATA 

A second PCR product was amplified using the 3'-terminal oligonucleotide 
Nhe I 3' (SEQ ID NO: 4) containing a Nhel restriction site and a 5'- 
oligonucleotide designated Mlu I 5' containing a Mlul restriction site. 

Mlu 1 5 ' SEQ ID NO: 6 

5'-CGCACGCGTATACTAAAAAATGAGCAGGCAAG 

The first PCR product was isolated and digested with EcoAim and Mlul, 
while the second PCR product was isolated and digested with Mlul and Miel. 
These digested products were isolated and ligatcd in a triple ligation with 
pRS313 previously digested with Eco47IH and Nhel. The resulting plasmid 
was designated P RS313/lxtetop-MluI. DNA sequencing confirmed the 
presence of the Mlul site in pRS3 1 3/lxtetop-MluI and confirmed that integrity 
of the EcoAim and Nliel junctions were maintained. 

A pRS303/lxtetop-M!uI plasmid was constructed by first 
removing the EcoAimiNhel fragment containing the altered HIS3 promoter 
from the pRS313/lxtetop-M/«I vector and ligating the isolated fragment into 
pRS303 previously digested with EcoAim and Nhel. DNA sequencing 
confirmed proper insertion of the EcoATmiNhel fragment. 

20 B. pRS303/2xtetnp-T VS? 

One copy each of the let operator sequence was engineered into 
positions -53 and -22 in the HIS3 promoter of pRS303 [Sikorski, et ai, 
Genetics 122:19-27 (1989)]. PCR was utilized to engineer one copy into 
position -53 which resulted in plasmid pRS303/lxtetop. To insert the second 
25 copy, a Mlul site was introduced at position -22 in the HIS3 promoter using 
PCR. The new plasmid was designated pRS303/lxtetop-MluI. 

The let operator was created by annealing two complementary 
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oligonucleotides tetop-1 and tetop-2. 

tetop-1 SEQ ID NO 7 

5-CGCGTACTCTATCATTGATAGAGTA; 

5 ' - ATG AG ATAGTAACTATCTC ATGCGC^ NO § 

When annealed, the tet operator sequence contains flanking Mlul sites. Both 
ohgonucleotides were phosphorated using 7* polynucleotide kinase (Gibco 
BRL, Grand Island, NY) at 37°C for one hour and annealed by first heating 
at 70°C for 10 minutes and then cooling to room temperature. The annealed 
ohgonucleotides were isolated and Hgated into P RS303/lxtetop-A//„I 
prevously digested with Mlul. The resulting p.asmid was designated 
P RS303/2xtet O p. DNA sequencing confirmed insertion of one copy of the tet 
operator sequence in the Mlul site. 

The LYS2 gene was digested from pLYS2 [Hollenberg S M 
et al., Mol. Celt. Biol. 15:3813-3822 (1995)] with Eco*l and Hiruim and the 
.solated fragment blunt ended using the large fragment of DNA polymerase 
I (G,bco BRL. Grand Island, NY). Phosphorylated SstI linkers (New England 
B.olabs. Beverly, MA) were ligated to the fragment, the fragment digested 
w.th SstI, and the resulting fragment ligated into P RS313 previously digested 
with SstI. The resulting plasmid was designated pRS313/LYS2. 

The LYS2 fragment was removed from pRS313/LYS2 with SstI 
digestion and inserted into P RS303/2xtetop previously digested with SstI. The 
resulting plasmid was designated P RS303/2xtetop-LYS2. 

Similarly, the LYS2 SstI fragment was inserted into 
P RS303/lxtetop-MIuI previously digested with SstI y i e , d pRS303/ 1 xtetop- 
MluI-LYS2. 



C DRS303 /3xtetop-LY g2 
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Two copies of the tet operator sequence were created by self- 
annealing a palindromic oligonucleotide Tctop 2x with itself. 

Tetop 2x SEQ ID NO* 9 

5'-CGCGTACTCTATCATTGATAGAGTCTAGACTCTATCAATGATAGAGTA 

The annealed oligonucleotide contained flanking Mlul sites. The 
oligonucleotide was phosphorylated, annealed, and isolated as above. The 
isolated annealed and ^-digested oligonucleotide was Iigated into 
P RS303/lxtetop-A/M-LYS2 previously digested with Mlul to yield 
P RS303/3xtetop-LYS2. The presence of two copies of the tet operator 
sequence in the Mlul site was confirmed by DNA sequencing. 



D pRS303/4xff,top-LY.S7 an d pRS10V« y teton-I.VS7 

Three or seven copies of the tet operator were created using 
PCR with Vent DNA polymerase as described above. Plasmid pUHC-I3-3 
[Grossen andBujarg, Proc. Natl. Acad. Sci. (USA) 89:5547-5551 (1992)] was 
used as template DNA using 5'- and 3'- oligonucleotides, Mlu I/Sph I 5' and 
Mlu I Sph I 3', containing an exterior Mlul restriction enzyme site nested 
internally by a Sphl restriction enzyme site. 

Mlu I/Sph 1 5' SE q j D N q. 10 

5'-GCGACGCGTGCATGCCGTCTTCAAGAATTCCTCGAG 

20 ^^ hI3 ' SEQ ID NO: 11 

5 -GCGACGCGTGCATGCCCACCGTACACGCCTACTCGA 

The PCR products were separated on an agarose gel and the ladder of 
different sized DNA fragments was isolated, digested with Mlul, and Iigated 
into the Mlul restriction site of P RS303/lxtetop-MluI-LYS2. DNA sequenc- 
25 ing revealed that either three or seven copies of tet operators were inserted 
into the Mlu site of pRS303/lxtetop-il/M- L YS2 to provide either 
pRS303/4xtetop-LYS2 or pRS303/8xtetop-LYS2. 
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E - pRS303/6xte top-LYS2 and pRS303/10xtetop -LYS7 

A Sphl restriction enzyme site was introduced at position -85 
in the HIS3 promoter of pRS303/3xtetop-LYS2 using PCR with Vent DNA 
polymerase as described. Plasmid pRS303/3xtetop-LYS2 was used as a 
template DNA. A first fragment was amplified using the 5 '-terminal 
oligonucleotide Eco47 m-5' (SEQ ID NO: 1) described above containing an 
EcoAim restriction site and a 3 '-oligonucleotide Sph I 3' containing a Sphl 
restriction site. 



S P hI y SEQ ID NO: 12 

10 5'-CATGGCATGCAAAAAAAAAGAGTCATCCGCTAGG 

A second PCR product was amplified using the 3 '-terminal oligonucleotide 
Nhe I 3' (SEQ ID NO: 4) described above containing a Nhel restriction site 
and a 5'-oligonucleotide containing a Sphl restriction site. 

S P hI5 ' SEQ ID NO: 13 

15 5 'CATGGCATGCTTAGCGATTGGC ATTATCACAT 

The PCR products were isolated as described above. The first PCR product 
was digested with Eco47m and Sphl, and the second PCR product was 
digested with Sphl and Nhel. Both digestion products were ligated in a triple 
ligation along with pRS303/3xtetop-LYS2 previously digested with both 

20 Eco47m and Nhel. The resulting plasmid was designated pRS303/3xtetop- 
SphI-LYS2. The presence of the Sphl site in pRS303/3xtetop-SphI-LYS2 was 
confirmed by DNA sequencing analysis. 

Three copies of tet operators were isolated as a single fragment 
by digesting pRS303/4xtetop-LYS2 with Sphl. The isolated fragment was 

25 ligated into the Sphl site of pRS303/3xtetop-S/^I-LYS2 to yield 
pRS303/6xtetop-LYS2. The presence of three additional copies of the tet 
operator in pRS303/6xtyetop-LYS2 at the Sphl site was confirmed by DNA 
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sequencing. 

Seven copies of tet operators were isolated as a single fragment 
by digesting P RS303/8xtetop-LYS2 with Sphl. The isolated fragment was 
Hgated into the Sphl site of P RS303/3xtetop-5MILYS2 to vield 
P RS303/10xteto P -LYS2. The presence of seven additional copies of the tet 
operator in P RS303/10xtetop-LYS2 at the Sphl site was confirmed by DNA 
sequencing. 
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F - PRS313/M1 U I and pR.S3Q3/MltiT 

A MM restriction enzyme site was engineered into position -22 
in the HISS promoter of pRS313 utilizing PGR and Vem DNA polymerase as 
noted above. Plasmid pRS313 was used as a template for these PGR 
reactions. One PGR construct was amplified using the 5' terminal 
oligonucleotide Eco47 m-5' (SEQ ID NO: 1) containing an Eco47m 
restriction site and a 3 ' oligonucleotide MIu I 3' (SEQ ID NO: 5) containing 
a MM restriction site. A second PCR product was amplified using the 3 ' 
terminal oligonucleotide Nhe I 3' (SEQ ID NO: 4) containing a Nhel 
restriction site and the 5 ' oligonucleotide Mlu I 5' (SEQ ID NO: 6) containing 
a MM restriction site. The first PCR product was isolated and digested with 
Eco47m and MM, while the second PCR product was isolated and digested 
with MM and Nhel. The digested products were partially purified and joined 
in a triple ligation with pRS313 which had been previously digested with 
Eco47m and Nhel. The resulting plasmid was designated P RS313/MluI. 
DNA sequencing confirmed the presence of the MM site in P RS313/Mlul and 
to confirm the integrity of the Eco47m and Nhel junctions. 

P RS303/MluI was constructed in exactly the same manner as 
pRS313/MluI except that pRS303 was used in place of pRS313. 

G. pRS313/lxtpm p 

See above wherein P RS3l3/lxtetop is an intermediate in the 
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constniction of pRS303/lxtetop-MluI. 

H - pRS313/MlnT-lxtemp a nH D RS3m/TvTl.,T-1 vt^p 

One copy of the tet operator sequence was created by annealing 
two complementary oligonucleotides tetop-1 and tetop-2 (SEQ ID NO: 7 and 
SEQ ID NO: 8). The annealed tet operator sequence contains flanking MM 
sites. The oligonucleotides were phosphorylated using T4 polynucleotide 
kinase (Gibco BRL, Grand Island. NY) at 37»C for one hour and annealed by 
first heating at 70»C for 10 minutes followed by cooling to room temperature. 
The annealed oligonucleotides were isolated and ligated separately into MM- 
digested P RS313/MluI and P RS303/MluI, the resulting plasmids being 
designated P RS313/MluI-lxtetopand P RS303/MluI-lxteto P . DNA sequencing 
confirmed the presence of one copy of the tet operator in the MM sites of 
both plasmids. 

In order to produce plasmids bearing multiple copies of the tet 
operator, annealed oligonucleotides described above were ligated together 
overnight at 16°C. After isolation of the ligation products, they were inserted 
into the MM of P RS313/MluI. DNA sequencing analysis confirmed that one 
clone, P RS313/MluI-4xtetop, was produced which contained four copies of tet 
operator in the MM site. However, upon further examination of this clone 
it was discovered that it had been subjected to a recombination event and was 
therefore not useful for further cloning steps. Continued attempts to insert 
multiple copies of the tet operator into the MM site of pRS313/MluI by 
ligating multimers of the tet operator have been unsuccessful. 

*• PRS313/lxfftfr.p .MI..T 

See above wherein construction of P RS313/lxtetop-MluI was 
an intermediate in the construction of P RS303/lxtetop-MluI. 
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J - PRS3 13/2x^1^ 

One copy of the ,„ operator sequence was created using 
anneal commentary oligonucleotides tetop-1 and teto P -2 (SEQ ID NO- 
7 and SEQ ID NO: 8). Annealed oligonucleotides were iigated into the Mluf 
sue of P RS313/lxtetop-MluI to yield P RS313/2xtetop. DNA sequencing 
confirmed the presence of two copies of the tet operator in the Mlul site. 

K pRS301/7vr^ r 

See above wherein P RS303/2xteto P was an intermediate in the 
construction of pRS303/2x/tetop-LYS2. 

L PRS313/T VQ 2 and p TtWi/j yep 

The LYS2 g ene was digested from pLYS2 with EcoXI and 
""din digestion. The EcolWHintm fragment was blunt ended using the 
arge fragment of DNA polymerase I (Gibco BRL, Grand Island, NY) and 
I.gated with phosphorylated Sstl linkers (New England Biolabs, Beverly MA) 
The resulting fragment was digested with Sstl and ligated into P RS3I3 
Piously digested with Sstl. The resulting plasmid was designated 
PRS3I3/LYS2. Because the LYS2 fragment was shown to have inserted into 
PRS3I3 In both orientations, pla smids with the LYS2 gene in both orientations 
were transformed separately into the yeast strain SEY6210* ( MAT a leu2- 
3, 112 uraS-52 HisS-^OO trpl-Ml l ys2 -801 suc2-A9 fRobins'on et al Mol 
Ceil. Biol. 8:4936-4948 (,988),. Both clones alloweo the yeast to grow'in the 
absence of lysine indicating that orientation of the LYS2 gene in pRS3,3 did 
not affect the expression of an active gene. 

ThC LYS2 fragmenl was reraov «i from pRS313/LYS2 with Sstl 
& and hgated into the Sstl site of: 

pRS313/lxtetop-MIuI giving plasmid P RS313/lxtetop-MluI-LYS2, 
pRS3 1 3/2xtctop giving plasmid P RS3 1 3/2xteto P -LYS2 
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pRS303/Ixtetop-MluI giving plasmid pRS303/lxtetop-MluI-LYS2, and 
pRS303/2xtetop giving plasmid pRS303/2xtetop-LYS2. 

n - Plasmids Encoding R eporter Hoha 
A - PRS3Q6/HT5;3;TetR/Tftrtn 

The 5' promoter sequence of the yeast HIS3 gene, 
encompassing nucleotides -75 to +23, was ligated to the translation^ start of 
TetR. In addition, the DNA sequence encoding the simian vims 40 (SV40) 
large T antigen nuclear localization signal was ligated in frame with the 
nucleotide sequence encoding the last amino acid residue of TetR. The 
chimeric fragment was created by the same PCR strategy as described above. 

The HIS3 promoter fragment, the primary 5' -PCR product, was 
amplified by PCR from plasmid p601 rGrueneberg,D.A., Science 257:1089- 
1095 (1992)] using a 5 '-terminal oligonucleotide T7 Promoter primer and a 
3'-inner oligonucleotide 3'-TetR inner primer. 

15 17 Promoter primer SEQ ID NO- 14 

5 ' -TAATACGACTC ACTATATAGGG 

3'-TetR inner primer SEQ ID NO- 15 

5 ' -TCTAG ACTTTGCCTTCGTTTATC 
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The primary 3' PCR product containing the TetR coding sequence was 
amplified from pSLF104 [Forsburg, Nucl. Acid. Res. 21:2955-2956 (1993)] 
with a 5'-inner oligonucleotide 5'-TetR inner primer and a 3'-terminal 
oligonucleotide 3 '-TetR terminal primer. 



5'-TetR inner primer SEQ ID NO- 16 

5'-CGAAGGCAAAGATGTCTAGATTAGATAAAAG 

25 3'-TetR terminal primer SEQ j^q. j 7 

5'-CGCGGATCCGCTTrCTCTTCTTTTrTGGAGACCCACTTTCACATTTAAG 
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An £«,ri site derived from the p60] fragmem a Bamm s . te ^ ^ ^ 
termmal oligonucieotide were used in subsequent subcloning The PGR 
products were gel-purified and amplified in a second PGR reaction with 5'- 
and 3-' terminal oligonucleotides, T7 Promoter primer (SEQ ID NO- 14) and 
5 3<-TetR terminal primer (SEQ ID NO: 17). The secondary PGR product was 
•solated, digested with EcoKI and BamHl, and ligated intQ pRS306/Tmn 
previously digested with ScoRI and Bamm. The resulting piasmid was 
designated pRS306/ffl S 3:TetS/Ter m which comprises the complete TetR 
codmg sequence in frame with sequences encoding the nuclear localization 
10 signal of SV40 large T antigen. 

B pRS3l6/m.^-T», P / Tmi1 

The construction protocol for this plasmid was the same as 
descnbed above for subcloning a HISS DNA into P RS306/Term except chat 
the vector for subcloning was P RS316/Term described above. 

15 C PRS306/1 yT .xAn p /m^ T 0 ,D 

Oligonucleotides LexAop (100a) and LexAop (100b) containing 
a smgle copy of LexA operator were phosphorylated with T4 polynucleotide 
kmase (Gibco BRL, Grand Island, NY) at 37°C for one hour 
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LexAop (100a) 

S'-aattgctcgagtactgtatgtac/tacTg^ag 8 

LexAop (100b) SEn 
5-AATTCTACTGTATGTACATACACTACTCGAGC 

Following phosphorylation, ,he oligonucleotides were annealed by heatin. a. 
70°C for .0 minutes (Mowed by cooling to room temperature. The annealed 
o.,g„nucleo,,de confining 5 - and 3 • £c„RI overhanging ends was subclone* 
■mo P RS306/HIS3 : Te.Rn-enn previously digested with EcoVl. The number 
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of copies of inserted oligonucleotide was confirmed by DNA sequencing The 
plasm.d containing a single copy of the LexA operator was designated 
pRS306/ 1 xLexAop/HIS3 :TetR. 

D PRS3 1 6/7vT j-xAop/WTS-VTWP 

The subcloning protocol for this construct was the same as 
described above for P RS306/lxLexAop/HI S 3:TetR. The annealed 
ohgonucleotides encoding the LexA operator included overhanging EcoKl ends 
and dunng ligation, the individual annealed fragments were ab.e to 
mutamenze, inserting into the parental plasmid more than one copy of the 
des.red LexA sequence. The number of copies of inserted oligonucleotides 
was confirmed by DNA sequencing. 

E - PRS306/3*T / acAop /ms vtVp 

A DNA fragment containing two copies of LexA operator and 
the chimeric HIS3:T*tR reporter was excised from 
pRS316/2xL e xAop/HIS3:TetR by digestion with Kpnl and Bamm restriction 
enzymes. The fragment was gel-purified and subcloned into P RS306/T e rm 
prev.ously digested with Kpnl and BamUl and the resulting construct was 
sequenced to confinn the presence of two copies of the LexA operator. 

F- PRS306/4xLexAop/HIS3:TetR 

and pRSlOft/Sxr/vAnp/w^^.T^p 

A pair of ohgonucleotides SH101 A and SH101B were utilized 
•n PCR to amplify the LexA binding site multimer from the plasmid SH18- 
34AS P e [Hollenberg, S.M., etal., Mol.CellBioL 15:3813-3822 (1995)1 
SH101A 

5'-CCGGAATTCTCGAGACATATCCATATCT^!?c 20 

^ V CCGGAa™, SEQID N0:21 
5 -CCGGAATTCACTAATCGCATTATCATC 
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The amplification product containing four copies of LexA operator was gel 
purified, digested with EcoKL, and subcloned into pRS306/HIS3:TetR/T 
previously digested with EcoW. The number of LexA operators v/ere 
determined by DNA sequencing. 

G - PRS306/SxT ^xAop /HTSl- Tp>P 

A PCR strategy was used to link the 5' promoter sequence of 
the yeast HIS3 gene encompassing nucleotides-75 to +23 to the translations 
stan of TetR. Sequences encoding the SV40 large T antigen nuclear local- 
ization signal were fused in frame with the nucleotide sequence encoding the 
last amino acid residue of TetR. The PCR product was digested with £coRI 
and BamiH and inserted into pRS306/Tenn previously digested with EcoRI 
and Bamm. The resulting plasmid was designated pRS306/HIS3:TetR/Term, 
and was shown to encode the complete TetR protein in frame with the nuclear 
localization signal of SV40 large T antigen. The fusion protein is followed 
15 by four amino acids generated by the vector backbone (Arg-Ile-His-Asp). 

The LexA binding site multimer from the plasmid pSH18- 
34ASpe [Hollenberg, S.M. etal., Mol. Cell. Biol. 15:3813-3822 (1995)] was 
amplified by PCR, digested with £coRI, and subcloned into the EcoW site of 
pRS306/HIS3:TetR/Term resulting in plasmid pRS306/8xLexAop/TetR. 

20 H. pADH/TetR 

The DNA coding sequence of TetR was amplified by PCR from 
PSLF104 using two oligonucleotides, NcoI-TetR and 3'-TetR terminal primer 
(SEQ ID NO: 17). 



NcoI-TetR SEQ ID NO: 22 

25 5 '-CATGCCATGGCCATGTCTAGATTAGATAAAAG 
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The resulting product was gel-purified, digested with Ncol and BamHI, and 
subcloncd into a pBTM116 [Bartel, et al., in Cellulnr 
Deyejo pment: a Practical Approach , Hartley (ed.), IRL Press; Oxford, pp. 
153-179 (1993)] shuttle vector containing an ADH promoter, previously 
digested with Ncol and BamHI. For construction of this vector DNA 
generated by PCR and DNA obtained by restriction enzyme digestion of the 
polylinker region in plasmid pBluescript (Stratagene, La Jolla, California) 
were used to engineer additional restriction sites 5 ' and 3 ' of the ADH 
promoter. The TetR prote in encoded from this construct is expressed 
containing additional amino acids Met^-Ala' before the initiating methionine 
and also contains the nuclear localization signal of SV40 large T antigen 
located after the last amino acid of TetR as described above. 
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l - nRS306/An H;TetR/Ter™ 

A fragment encoding the ADH promoter and TetR was removed 
from plasmid pADH/TetR with Xhol and blunted-ended with the large 
fragment of DNA polymerase I (Gibco BLR, Grand Island, NY). EcoXl 
linkers (New England BioLabs, Beverly, MA) were added and the fragment 
was digested with EcoXl and BamHI. The resulting fragment was gel-purified 
and ligated into pRS306/Term previously digested with Ecom and BamHI. 

J- pRS306/4xLexAop/ADH::TetR 

and DRS306/S x Le X Anp/ADH::TfttR 

The subcloning protocol used to insert multiple copies of the 
LexA operator into pRS306/ ADH : TetR/Term was the same as described 
previously for pRS 3 06 14 x Le x A op / H I S 3 : Te t R and 
25 pRS306/8xLexAop/HIS3:TetR. 
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m - Plasmids Encoding Binding Proteins 

A. DLexA-CBD 

A DNA fragment containing the CREB binding domain of CBP 
(CBD), amino acids 461-682, was PCR amplified from plasmid CBP-0.8 
5 [Chrivia, J.C. et al., Nature 365:855-859 (1993)] using a pair of 
oligonucleotides designated 5' CBD primer and 3' CBD primer. 

5 S?2 Primer SEQ ID NO: 23 

5 -GCGAATTCGCCAGGGCAACAGAATGCCACT 
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3' CBD primer SE q id nq . 24 

5'-CGGGATCCTGGCTGGTTACCCAGGATGCCTTG 



Following gel purification, the amplification product was digested with EcoW. 
and BamHI, and ligated into plasmid pBTM116 fBartel, et al., in Cellular 
Interactions in Development: a Practical Approach, (ed) Hartley, D.A. (IRL 
Press, Oxford), pp. 153-179 (1993)] previously digested with EcoW and 
15 BamHI. 



B. DVP16-CBD 

A DNA fragment encoding the CBP sequence was excised from 
pLexA-CBD by digestion with EcoW. and BamHI. Plasmid pLexA-CBD was 
linearized with EcoW digestion, the resulting overhanging ends blunt-ended 
using the Klenow fragment of DNA polymerase I, and the ends ligated with 
BamHI linkers. The resulting fragment was inserted into pVP16 [Hollenberg, 
etal.. Mol. Cell. Biol. 15:3813-3822 (1995)] previously digested with into 
BamHI. 
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C. PVP16 PRPR 

Plasmid pcDN A3/ CREB2 83 [Sun and Maurer, J. Biol. Chem. 
270:7041-7044 (1995)], containing the VPI6 transaction domain fused to 
sequences of the rat CREB transactivation domain (1 to 283 aa) was linearized 
5 with Xhol and BamHI linJcers (New England BioLab) ligated to the resulting 
blunt-ended Xhol sites. DNA encoding the VP16/CREB chimeric protein was 
removed with HindW and BamHI digestion and following gel purification, 
ligated into the Hindm and BamHI sites of pVP16 which encodes the LEW 
gene. 

10 D dVPI 6-rRP RfBgin-Sacn^-T ^7 

A DNA fragment encoding £-galactosidase was PCR amplified 
from plasmid pSV-0-galactosidase vector (Promega, Madison, WI) using a 
pair of oligonucleotides, 5 ' 0-gal primer and 3 ' /3-gal primer and inserted into 
the Notl site of pVP16 to produce pVP16-LacZ. 

15 _ . 5 ' 13 S* 1 P rimer SEQ ID NO: 29 

5 -ATGGTACCAGCGGCCGCTAGTCGTTTTACAACGTCGTGAC 

3 ' 0-gal primer SEQ ID NO: 30 

5 -ATGGTACCGCGGCCGCTTATTTTTGACACCAGACCAAC 
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A PCR fragment containing CREB sequences encoding amino acid residues 
1 to 283 was amplified from plasmid pRSV-CREB34I [Kwok, et al., Nature 
380: 642-646 (1996)] using a pair of oligonucleotides, 5 * CREB 341 primer 
and 3 ' CREB 283 primer, and inserted into pVP16-LacZ vector at the BamHI 
site. 

5 ' CREB 341 primer SEQ ID NO: 25 

5 ' -CGCGGATCCGGATGACC ATGG ACTCTGG AG 

3 CREB 283 primer SEQ ID NO- 28 

5-CGCGGATCCGTGCTGCTTCTTCAGCAGGCTG 
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To generate a cassette vector for producing and subcloning mutated CREB 
sequences as described below, PCR was used to engineer a Bgtn site using 
oligonucleotides 5' Bglll primer and 3 ' Bglll primer, at nucleotides 273 to 
278 and a Sacll site using oligonucleotides 5' SacJl primer and 3' SacU 
5 primer at nucleotides 500 to 505 of the CREB activation domain. 

5 ' Bglll primer SEQ ID NO- 3 1 

5 -CGGAGATCTAAAGAGACTTTTCTCCGGAACTCAG 
3 ' Bg m primer SE q id nq . 32 

5 -CGGAGATCTTTACAGGAAGACTGAACTGT 

10 5 SacI1 P rim er SE q id nq . 33 

5 -CCACCGCGGCAGTGCCAACCCCGATTTAC 

3 ' SacU primer SEQ ID NO: 34 

3 '-CATCCGCGGTGGTGATGGCAGGGGCTGA 
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E. pLexA-CRFR 283 

A DNA fragment containing the rat CREB transactivation 
domain (amino acids 1 to 283) was excised from pcDNA/CREB283 [Sun and 
Maurer, supra] with Smal and Xbal digestion. The 5 ' Xbal site was blunt 
ended with the large fragment of DNA polymerase I (Gibco BRL, Grand 
Island, NY) and Sail linkers (New England Biolabs, Beverly, MA) added. 
The fragment was digested with Sail and subcloned into the Sail site of 
pBTM116. 



F. pLexA-CRFR 74 1 

A DNA fragment containing the rat CREB 341 cDNA was 
amplified by PCR from pcDNA/CREB34I [Kwok, supra] using a pair of 
25 oligonucleotides, 5 ' CREB 341 primer (SEQ ID NO: 25) and 3 ' CREB 341 
primer. 
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3 ' CREB 341 primer SEQ ID NO- 26 

5 -CGCGGATCCTTAATCTGACTTGTGGCAGTA 

After gel purification, the PCR product was digested with BamHl, and 
subcloned into the BamHl site of pBTM116. 

G. pLexA-CRFR 341 -Ml 

A DNA fragment containing the rat CREB sequence with a 
mutation changing serine at position 133 to alanine was amplified by PCR 
from plasmid Rc/RSV CREB-M1 [Kwok, a al.. supra] using the same set of 
primers as described for pI^xA-CREB 341, 5 ' CREB 341 primer (SEQ ID 
NO: 25) and 3' CREB 341 primer (SEQ ID NO: 26). The resulting 
amplification product was gel-purified, digested with BamHl, and subcloned 
into the BamHl site of pBTMl 16. 

H. PVP16-CPKR mi 

A PCR fragment containing CREB sequences coding for amino 
acid residues 1 to 283 including the serine 133 mutation to alanine was 
amplified using a pair of oligonucleotides, 5 ' CREB 283 primer and 3 ' CREB 
283 primer (SEQ ID NO: 28). The PCR fragment was gel-purified, digested 
with BamHl and inserted into the BamHl site of pVP16. 

5 CREB 283 primer $Eq jd N q- 27 

5 -CGCGGATCCCCATGACCATGGAATCTGGAGCC 

I. pLexA-SRF 

A DNA fragment containing human SRF was excised from 
plasmid pCGN-SRF [Grueneberg, D.A., et al., Science, 257:1089-1095 
(1992)] with Xhol and BamHl digestion. The Xhol site of the fragment was 
blunt-ended by the large fragment of DNA polymerase I (Gibco BRL, Grand 
Island, NY), ligated with BamHl linkers, digested with BamHl. and inserted 
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into pBTM116 previously digested with BatnHl. 

J. pVPI6-Tax 

A DNA sequence encoding full length Tax protein was excised 
from pS6424 [Kwok, R.P.S., etal. Nature 380:642-646 (1996)] with BainHl 
5 digestion and was inserted into pVP16 previously digested with BamHI. 

Plasmirfs For Binding Protein Controls 

A. pLeu 

Plasmid pVP16 was digested with HindHI and BamHI to 
remove the fragment encoding the VP1 6 transactivation domain. The digested 
vector was blunt-ended and self-ligated. 



10 



B. pLexA-VP16 

The VP16 transactivation domain was PCR amplified from 
pGaI-VP16 [Sadowski, et at., Nature 335:563-564 (1988)] with a pair of 
oligonucleotides, 5 -VP16SH and 3 VP16SH and the resulting amplification 
15 product was digested with Clal, blunt-ended, and inserted into pBTMl 16. 



5 -VP16SH SEQ ID NO: 35 

GGCTATCGATACGGCCCCCCCGACCGAT 

3 "-VP16SH SEQ ID NO: 36 

GC GTATC G ATCTAC CC ACCGTACTCGTC 



20 C. pT-P.xA-Tjiipin 

See Hollenberg, S.M. et aL, Mol.Cell.Biol. 15:3813-3822 

(1995)]. 
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v - Plasmids Encoding Reporter Gene Controls 
A. pRS306/Term 

The alcohol dehydrogenase (ADH) terminator sequence was 
excised from plasmid pBTM116 [Bartel, et al., in Cellular Interactions in 
5 Development: a Practical Approach, (ed) Hartley, D.A. (IRL Press, Oxford), 
pp. 153-179 (1993)] with Sphl and Pstl restriction enzymes and both 3'- 
ovcrhanging sequences were blunted by T4 DNA polymerase (Gibco BLR, 
Grand Island. NY). The fragment was gel-purified and subcloned into the 
blunt-ended Notl site in pRS306 [Sikorski and Hieter, Genetics:122: 19-27 
10 (1989)J. The orientation of inserted fragment was determined by DNA 
sequencing. 



B. PRS316/Term 

The subcloning protocol for inserting the ADH terminator 
sequence into pRS316 was the same as described for inserting the ADH 
15 sequence in pRS306. 

Example 2 

Generation of Yeast Assay Transformant 

Selection of an appropriate yeast assay strain is an empirical 
determination based on growth characteristics of the transformed alternatives. 

20 A general method to make the appropriate selection is described as follows. 

Candidate yeast assay strains were transformed individually with 
reporter gene constructs and/or a plasmid encoding one of the experimental 
binding proteins. Assay strains thus transformed were then compared for 
relative differences in growth characteristics, with an optimal assay strain 

25 showing negligible growth on media lacking histidine and vigorous growth on 
media containing histidine. In practical application of this first step in selection 
using various plasmids transformed into assay strain Y1584, the following 
results were observed. 
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When the piasmid pLexA-VP 1 6 encoding both the LexA DNA 
binding domain and the VP16 transacting domain as a single protein was 
introduced into the assay cells, growth in the absence of histidine in the media 
was significantly reduced three days after transformation. 

In assays including transformation with plasmids encoding 
multiple copies of the tet operator upstream of the HISS gene, the following 
plasmids were separately utilized: 

P RS303/lxtetop-W5 (encoding a single tet operator sequence), 
P RS303/2xtetop-W5 (encoding two tet operator sequences). 
pRS303/3xtetop-7//S (encoding three tet operator sequences), 
P RS303/4xtetop-/ffS (encoding four tet operator sequences), 
P RS303/6xtetop-///S (encoding six tet operator sequences), 
P RS303/8xtetop-W5 (encoding eight tet operator sequences), or 
pRS303/10xtetop-///S (encoding ten tet operator sequences). 



In the assay strains transformed with plasmids encoding either one, two, or 
three copies of the tet operator upstream from the HIS3 gene, cells grew on 
media lacking histidine at a rate similar to cells grown on media containing 
histidine. In yeast assay strains transformed with plasmids encoding either 
six, eight, or ten copies of the tet operator upstream from the HIS3 gene, cell 
growth was low suggesting that these strains would not be useful in assays to 
examine binding and interruption of binding between test proteins. These 
results suggested that, in assay strains transformed with a reporter piasmid 
having more than three tet operator sequences upstream from the HIS3 gene, 
normal activity of the HISS promoter is disrupted and that these plasmids 
25 would not be useful. 

In assays wherein yeast cells were transformed with only 
reporter plasmids (and not plasmids encoding binding partner fusion proteins) 
encoding multiple copies of the LexA operator 5' of the TetR gene, the 
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following results were observed. Growth of assay cells transformed with 
plasmids bearing one, two, four, and eight copies of the regulatory LexA 
operator upstream of the TetR gene appeared to be "copy number" dependent. 
Yeast cells transformed with plasmids having two copies of the LexA operator 
grew at a rate significantly higher than those assay cell transformed with a 
piasmid bearing only one copy of the operator. Cells transformed with 
plasmids encoding either four or eight LexA operators upstream of the TetR 

gene grew at an approximately equal rate, and better than assay cells bearing 

a TetR gene driven by two copies of the operator. 

When the alcohol dehydrogenase (ADH) promoter was included 

upstream of the LexA operator (plasmids encoding either four or eight LexA 

operators) in the various reporter gene constructs, cell viability was the 

lowest. 

The various cell lines constructed by the methods described 
above are shown in Table 1, wherein various transformed yeast strains are 
identified (Strain ff) along with the number of LexA operator sequences in the 
piasmid encoding TetR, the number of tetracycline operator sequences 
regulating expression of HIS3, and relative growth rate of the transformed 
strain on media containing histidine. It is important to note that growth 
variation of transformed cells in media containing histidine is observed, even 
in cell lines identically transformed. The number of " + " signs in Table 1 is 
indicative of the host cell's relative abiJity to grow on media lacking histidine 
in the absence of transformation with plasmids encoding potential binding 
proteins. Also in Table l, a subscript "a" is indicative of transformation with 
a piasmid bearing the alcohol dehydrogenase promoter; absence of a subscript 
"a" indicates use of the HIS3 promoter. 
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Table 1 

VarioiLS Yeast Tranqfrmmanrc 
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Diploids L40 

Strain # LexA TclOp Hix + 
YI579 IX 2X 4 + + 
YI58I IX 2X 44-4 



YI580 
YI532 



2X 
2X 



2X 
2X 



4 + 4 
4 + + 



Diploid* L40 

Strain 8 LexA TetOp His4 
YI583 4X 2X 4 + h 
YI585 4X 2X + 4-4 
YI5R7 4X 2X 4 4-4 
YI58Q 4X 2X + 4-4 



YI584 


8X 


2X 


*+■■+* + 


YI580 


8X 


2X 


4- + 4 


YI588 


8X 


2X 


4 4- 4 


YI590 


8X 


2X 


+ 4- + 


Diploids 


L40 






Strain # 


LexA TctOp His + 


YI59I 


2X 


2X 




YI594 


2X 


2X 


4-4 4 


YI597 


2X 


4X 




YI633 


2X 


4X 


+ 


Y16.1to 


2X 


4X 


+ 4- 


YI600 


2X 


6X 




Y1606 


2X 


6X 


4- 


YI630 


2X 


6X 


4- 


YI627 


2X 


6X 


+ 4 + 


Y1603 


2X 


10X 


4 


YI62I 


2X 


I0X 


4- 4- 


YI609 


2X 


10X 


+ 


YI624 


2X 


I0X 


+ 4 


YI593 




2X 


4- + + 


YI595 


4X 


2X 


+ 4-4- 


YI599 




4X 




YI634 


4X 


4X 


+- 


YI638 


4 *n 


4X 


4 



Diploids L40 




Strain # 


LexA TeiOp His 4 


YI602 


4X a 6X 




YI607 


4X 6X 


+ + 4- 


YI628 


**A OX 


+ + -1 


YI632 


4X :i 6X 




YI60S 


4X n 10X 




YI6I0 


4X 10X 


4 


YI622 


4X J OX 


4 -t 


YI62b 


4X n 10X 


4 + 


YI592 


8X 2X 


44 4 


Y!59b 


8X n 2X 


4 4 4- 


YI59R 


8X 4X 


4 


YI635 


8X n 4X 


4 


YI637 


8X 4X 


4- 4 


YI60I 


8X 6X 


4 


YI608 


8X Q 6X 


4 


YI629 


8X ft 6X 


4 4 4 


YI631 


8X 6X 


4-4 4 


YI604 


8X I0X 


4 


YI6II 


8X D I0X 


+ 


YI623 


8X a I0X 


4 4 


YI625 


8X I0X 


4- + 



Strain* Lex A TclOp strain His 4 

YI664 4X a 3X w303(50) 4 + + 

Y1660 4X 0 3X w303(5l) 4 44 

YI668 4X n 2X L40 (69) + + 4 

Y1670 4X a 2X L40(70> + + + 

YI665 8X a 3X w303(50) + 4- + 

YI667 8X Q 3X w303{5l) + + + 

YI67I 8X a 3X L40(69) + + 4 

YI669 8X a 2X L40 (69y + 4-4 

YI67I 8X 3 2X L40f70) 4- + 4 

YI67I 8X a 6X L40(69) + + 4 
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Example 3 
CREB/CBP Binding Interaction 

Use of the split-hybrid assay for studies of protein/protein 

binding wherein one of the binding components is randomly mutagenized was 

carried out using CREB and CBP binding proteins. The binding of CREB to 

CBP has been shown to require the phosphorylation of the CREB serine 

residue at position 133 in a region designated the "kinase-inducible domain" 

(KID) [Chrivia, et al., Nature 365, 855-859 (1993); Kwok, et al., Nature 

370, 223-226 (1994)]. Functionally, changing serine at position 133 to 

alanine (a mutant designated CREB-MI) abolishes the ability of CBP to 

activate CREB-mediated transcription. Preliminary studies have indicated that 

the CREB-MI mutant in the split-hybrid system prevents the interaction with 

CBP and subsequent growth of the yeast assay strain on media lacking 

histidine. Precisely what other portions of the KID of CREB are required for 

binding to CBP is unknown, however. To define other potentially important 

amino acid residues, the KID (amino acid residues 102 to 160) of CREB 341 

was randomly mutagenized using PCR. 

A - PCR Mutagenesis and Creation nf Mutant Library 

The technique used for mutagenic PCR was a modification of 
20 that described by Uppaluri and Towle [Mol. Cell. Biol. 15, 1499-1512 
(1995)]. The reaction mixture contained 20 ng of pVP16-CREB(BgUI-Sacn)- 
LacZ, 16 mM (NH 4 ) 2 S0 4 , 67 mM Tris-HCl, pH 8.8, 6.1 mM MgCl 2 , 0.5 
mM MnCl 2 , 6.7 ^M EDTA, 10 mM 0-mercaptoethanol, 1 mM primers, ImM 
each dGTP, dTTP, and dCTP, 400 uM dATP. and 2.5 units of Tag DNA 
polymerase (Promega, Madison, WI). After seven cycles of PCR (94°C for 
40 sec, 50»C for 40 sec, and 72"C for 40 sec), the PCR product was 
amplified a second time using the same primers and Vent DNA polymerase 
(New England BioLabs, Beverly, MA) under the same conditions for 25 
cycles. The resultant PCR product was gel purified, digested with Bglll and 
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SacTL, and inserted into the BgKL and Sacll sites of pVP16-CREB(BglII-SacII)- 
LacZ (construction of which is described above). The resulting plasmids were 
transformed into DH5or bacterial cells. Transformants were pooled and 
plasmid DNA was isolated by CsCl gradient centrif ligation. 

B Construction and TT«» of DVP16-CR FR fP p in-SacnVLacZ 

A DNA fragment encoding the 0-galactosidase gene was fused 
in frame to the carboxyl-terminal end of VP1 6-CREB as described above. 
The carboxy-terminal tag allowed identification of clones that contain frame- 
shift and nonsense mutations; colonies that remain positive for /8-galactosidase 
were presumed to contain an open reading frame throughout the mutated 
region. To facilitate the subcloning of mutated sequences, a cassette version 
of the CREB cDNA was generated that contained BgtR and a SacU sites 
flanking the 5 ' and 3 ' ends of the KID, respectively. These modifications 
altered the amino acid residue at position 168 from valine to alanine. The 
cDNA altered in this manner was indistinguishable from the original VP16- 
CREB and from VP1 6-CREB-LacZ when tested in the split hybrid assay. 
Primers complementary to regions flanking the KID were used in mutagenic 
PCR amplification reactions as described above under conditions which were 
optimized to achieve one to three mutations in the 177 bp region encoding the 
KID. PCR products were introduced into pVPl 6-CREB(Sgm-SacII)-LacZ in 
place of wild-type sequence. A library of mutated sequences was transformed 
into yeast assay strain YI584 expressing LexA-CBD. Approximately 27,000 
yeast transformants were screened, yielding about 5,000 colonies that were 
capable of growing on selective media supplemented with 10 Atg/mi of 
tetracycline and 1 mM of 3 AT, determined as described below. 

Two screening steps were performed to eliminate uninformative 
mutations and false positives. First, filter /3-galactosidase assays were 
performed by standard methods [Vojtek, et al.. Cell 74:205-214 (1993)] on 
the 5,000 colonies which exhibited positive growth on media lacking 
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tryptophan, histidine, uracil, leucine, and lysine to eliminate expressed 
proteins having frame-shift and nonsense mutations. Five hundred thirty six 
colonies developed a dark blue color, whereas 412 colonies turned white and 
were presumed to express mutants containing either frame-shift or nonsense 
5 mutations. The other colonies developed a pale blue color, and control 
experiments suggested that these colonies may have expressed unstable lacZ 
fusion proteins. Pale blue colonies were not analyzed further. 

DNA from 536 dark blue colonies was isolated and transformed 
into E.coli MCI 066 cells. One hundred ninety three pVP16-CREB-(BgUI- 
10 SacII)-LacZ cDNAs were then isolated. 

In a second screening step, the 193 cDNAs were separately re- 
transformed along with pLexA-CBD into the split-hybrid strain as well as into 
the two-hybrid L40 strain [Vojtek, et aL, supra] in order to identify false 
positives and confirm that the mutant CREB proteins did not interact with 
15 CBP. Among the 193 cDNAs re-screened, 152 did not interact with CBP in 
the yeast two-hybrid system, 15 interacted weakly, and 26 interacted like wild 
type CREB. 

Following these two screening steps, the 152 CREB mutants 
were sequenced. Seventy CREB mutants were found to contain a single 
amino acid change. Sixty four CREB mutants contained two amino acid 
residue mutations and 13 mutants contained more than two amino acid 
mutations. Mutants containing more than one amino acid alteration were not 
analyzed further. The expression level of mutant proteins having one amino 
acid change were determined using a standard 0-galactosidase assay. 

The CREB mutations identified in the split-hybrid screen were 
shown to carry amino acid changes centered around the phosphorylation site 
at serine at position 133. No disrupting mutations were found to contain 
amino acid alterations outside of the region between amino acids 130 to 141. 
Most of the mutations abrogated the PKA phosphorylation region, but others 
were identified at isoleucine position 137, leucine at position 138, and leucine 
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at posuaon 141. The mutations at positions 137, 138, and 141 generally 
changed the hydrophobic residues at these positions to polar residues The 
abd.ty of the split-hybrid system to detect only a iimited number of CREB 
mutants, many of which have been proposed previously to disrupt CREB 
5 association with CBP [Parker, ei al. , Mol. Cell. Biol. 16, 694-703 (1996)] 
indicates the specificity of the split-hybrid system. 

These results lead to interesting suggestions. Various CREB 
mutauons were identified which disrupt CREB-CBP interaction and the 
majonty of disrupting mutations occurred in the CREB PKA phosphorylation 
mouf. This result was consistent with previous observations that 
nonphosphorylated CREB and CBP do not interact [Kwok, et al. Nature 
370:223-226 (1994)]. The most common motif for PKA phosphorylation is 
an RRX(S/T)X amino acid sequence but RX(S/T)X and KRXX(S/T)X are Uso 
phosphorated [Kemp and Pearson, T.I.B.S. 15, 342-346 (1990)] The 
15 argmme residues in the phosphorylation site arc critical for electrostatic 
mteract.ons with acidic amino acid residues in the catalytic subunit of PKA 
[Kmghton, et al.. Science 253, 414-420 (1991)], and consistent with this 
observation, CREB mutants with changes at arginine residues 130 and 131 
were .dentified in the split hybrid assay that did not interact with CBP. 
0 Results also showed that CREB mutations at amino acids 

proline at residue 132 and tyrosine 134 were unable to bind CBP. It is likely 
that the mutations at these residues adversely affect the structure of the 
phosphorylation motif, although these positions are generally thought to be 
less crmcal to CBP binding. It is possible that the substitution of proline at 
5 pos,t,on 132 with threonine created a new phosphorylation site (RXTX) that 
mterfered with the critical phosphorylation of serine at position 133 
Although not generally thought to be part of the "classical" consensus PKA 
phosphorylation motif, hydrophobic amino acids are commonly found 
carboxy-terminal to PKA sites [Kemp, et al, T.I.B.S. 19:440-444 (1994)] 
The importance of these flanking residues may explain the frequent occurrence 
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of disrupting mutations involving tyrosine at position 134. Further studies 
will be directed to determining if mutations of proline at position 134 and 
tyrosine at position 134 directly disrupt phosphorylation of serine at position 
133 or disrupt binding of CREB to CBP by some other mechanism. 

In addition, substitution of serine at position 133 with threonine 
also prevented the interaction of CREB and CBP. PKA protein substrates 
containing a phosphorylatable threonine residue are known to exist in nature 
(i.e., protein phosphatase inhibitor 1 and myelin basic protein), although they 
are less common than those with phosphorylatable serines [Zetterqvist, et aL, 
in Peptides and Protein Phosphorating , (ed.) Kemp, B.E. (CRC Press, Boca 
Raton, FL), pp. 172-187 (1990)], and synthetic peptides containing serine to 
threonine substitutions are relatively poor substrates for PKA phosphorylation 
[Zetterqvist, etal, supra]. In the split-hybrid assay, however, it is unclear 
whether the mutation of threonine at position 133 disrupts the CREB-CBP 
interaction or if the mutant fails to become phosphorylated. Despite previous 
observations that serine residue at position 133 of mammalian CREB can be 
phosphorylated by a variety of protein kinases other than PKA, for example 
calcium/calmodulin-dependent protein kinase H and IV, protein kinase C, and 
a nerve growth factor (NGF)-activated CREB kinase [Sheng, et aL, Neuron 
4:571-582 (1990); Sheng, et al., Science 252:1427-1430 (1991); Xie and 
Rothstein, J. Immunol. 154:1717-1723 (1995); Ginty, et aL, Cell 77:1-20 
(1994)], it is not known which, if any, of these particular protein kinases are 
able to phosphorylate CREB at the serine at position 133 in yeast. The 
requirement for integrity of the entire RRXSX amino acid sequence, however, 
25 suggests that PKA is a reasonable candidate. 

The second category of mutations were identified adjacent the 
PKA phosphorylation motif. Amino acids isoleucine at position 137 and 
leucine at position 138 have previously been suggested to be important for 
hydrophobic interactions of CREB with CBP fParker, etaL.Mol. Cell. Biol. 
16. 694-703 (1996)]. In this study, most of the mutations at position 137 and 
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138 converted these hydrophobic residues to polar amino acids. Thus, another 
possibility for the failure of these mutants to bind to CBP is that changes at 
these positions affect protein folding. Similarly, the mutation at position 141 
substituted a polar residue for the wild-type hydrophobic leucine, and this 
mutation also has the potential to affect protein folding. 

Substitution of the isoleucine at position 137 with a hydrophobic 
phenylalanine residue was found to disrupt the interaction between CREB and 
CBP as well. This result could have been the result of a detrimental effect on 
folding because of the steric hindrance associated with the comparatively 
larger size of phenylalanine. Alternatively, the proposed hydrophobic 
interactions between CREB and CBP are somewhat specific. Structural 
studies will be directed to definitively determine how these mutations affect 
binding. 

Perhaps most surprising was the finding that critical mutations 
were restricted to a small region in the KID sequence, even though the 
relatively low affinity of phosphoryiated CREB and CBP. determined to be 
between 250 and 400 nM by fluorescence anisotropy measurements [Kwok, 
et al.. Nature 370, 223-226 (1994)], is consistent with a restricted protein 
binding domain. The capability of the split-hybrid system to screen for a 
limited number of CREB mutants suggests that the system is highly specific, 
and thus, should be useful to identify mutations which disrupt interacts 
between other pairs of binding proteins. 
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Example 4 
Tax/SRF Binding Interaction 

To further investigate the feasibility of using the split-hybrid 

system to study protein-protein interactions, a pair of well characterized 

interacting proteins, SRF and Tax, was tested. Previous studies indicated that 

SRF and Tax interact in a standard yeast two-hybrid system suggesting that 

the proteins may be utilized in the split hybrid assay. Plasmid pLexA-SRF, 

containing a human SRF cDNA fused to the LexA DNA binding domain, was 

transformed into strain YI584 along with either pVP16-Tax or pVP16 alone. 

As with the P LexA-VP16 transformation, the yeast strains co-expressing 

LexA-SRF and VP16 Tax failed to yield any colonies on medium lacking 

histidine. In contrast, when LexA-SRF was co-transformed with a vector 

encoding the VP16 activation domain alone, yeast growth occurred on medium 

lacking histidine, suggesting that TetR expression was not activated. These 

results demonstrated that a protein-protein interaction in the split-hybrid 

system can effectively prevent yeast growth and further indicated the utility 

of the assay for the study of various protein/protein interactions. 



Example 5 
Casein Kinase Binding Assays 

20 Hn-25 

In another example of use of the split hybrid assay to examine 
protein/protein interactions, Hrr25, a yeast casein kinase isoform, or human 
casein kinase I isoform 5, was employed in the assay with a known binding 
partner protein. 

Previous work using the two hybrid assay had identified three 
genes encoding proteins which interact with the yeast casein kinase isoform 
Hrr25. Proteins encoded by the genes were designated TTH1, TIH2, and 
TIH3. The Hrr25 expression construct which was generated for use in the 
two hybrid assay was used in combination with the individual TEH encoding 
constructs in the split hybrid assay to determine if interaction between the 
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binding partners would decrease growth of assay yeast cells on media lacking 
histidinc. Construction of the Hrr25 expression plasmid and isolation of 
plasmids encoding TEH proteins is discussed below. 

In order to identify genes encoding proteins that interact with 
S. cerevisiae HRR25 CKI protein kinase, a plasmid library encoding fusions 
between the yeast GALA activation domain and S. cerevisiae genomic 
fragments ("prey" components) was screened for interaction with a DNA 
binding domain hybrid that contained the E. coli lexA gene fused to HRR25 
("bait" component). The fusions were constructed in plasmid pBTM116 
which contains the yeast TRP1 gene, a 2„ origin of replication, and a yeast 
ADHI promoter driving expression of the E. coli lexA protein containing a 
DNA binding domain (amino acids 1 to 202). 

Plasmid P BTM116::HRR25 encoding the lexA::HRR25 fusion 
protein was constructed in several steps. The DNA sequence encoding the 
initiating methionine and second amino acid of HRR25 was changed to a Smal 
restriction site by site-directed mutagenesis using a MutaGene mutagenesis kit 
from BioRad (Richmond, California). The DNA sequence of HRR25 is set 
out in SEQ ID NO: 39. The oligonucleotide used for the mutagenesis is set 
forth below, wherein the Smal site is underlined. 

5'-CCTACTCTTAGGC^C^GGTCTrTTTAATGTATCC-3' 

(SEQ ID NO: 37) 

After digestion with Smal, the resulting altered HRR25 gene was ligated into 
plasmid pBTM116 at the Smal site to create the lexA::HRR25 fusion 
construct. 

Interactions between bait and prey fusion proteins were detected 
in yeast reporter strain CTY10-5d (genotype =MATa ade2 trpl-901 leu2- 
3,112 his 3-200 gal4 ga i 80 URA3::lexA op-lacZ.) [Luban, et al r e ll 
75:1067-1078 (1993)] carrying a lexA binding site that directs transcription of 
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lacZ. Strain CTYI0-5d was first transformed with plasmid 
pBTMI16::HRR25 by lithium acetate-mediated transformation [Ito, et ai, 
J. Bacterial. 153: 163-168 (1983)]. The resulting transformants were then 
transformed with a prey yeast genomic library prepared as GAL4 fusions in 
the plasmid pGAD [Chien, et al., Proc. Natl. Acad. Sci (USA) 27:9578-9582 
(1991)] in order to screen the expressed proteins from the library for 
interaction with HRR25. A total of 500,000 double transformants were 
assayed for 0-galactosidase expression by replica plating onto nitrocellulose 
filters, lysing the replicated colonies by quick-freezing the filters in liquid 
nitrogen, and incubating the lyscd colonies with the blue chromogenic 

substrate5-bromo-4-chloro-3-indolyl-/?-D-gaIactoside(X-gaI)./3-galactosidase 
activity was measured using Z buffer (0.06 M Na 2 HP0 4 , 0.04 M NaH 2 P0 4 , 
0.01 M KC1, 0.001 M MgS0 4 , 0.05 M 0-mercaptoethanol) containing X-gal 
at a concentration of 0.002% [Guarente, Meth. Enzymol. 7(37:181-191 (1983)]. 
Reactions were terminated by floating the filters on 1M Na 2 C0 3 and positive 
colonies were identified by their dark blue color. 

Library fusion plasmids (prey constructs) that conferred blue 
color to the reporter strain co-dependent upon the presence of the 
HRR25/DNA binding domain fusion protein partner (bait construct) were 
identified. The sequence adjacent to the fusion site in each library plasmid 
was determined by extending DNA sequence from the GAL4 region. The 
sequencing primer utilized is set forth below. 

5 '-GGAATCACTACAGGGATG-3 ' (SEQ ID NO: 38 ) 

DNA sequence was obtained using a Sequenase version H kit (US 
Biochemicals, Cleveland, Ohio) or by automated DNA sequencing with an 
ABI373A sequencer (Applied Biosystems, Foster City, California). 

Four library clones were identified and the proteins they encoded are 
designated herein as TTH proteins 1 through 4 for Targets Interacting with 
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HRR25-like protein kinase isoforms. The TIH1 portion of the HH1 clone 
insert corresponds to nucleotides 1528 to 2580 of SEQ ID NO: 40; the TIH2 
portion of the TIH2 clone insert corresponds to nucleotides 261 1 to 4053 of 
SEQ ID NO: 41; and the TIH3 portion of the TIH3 clone insert corresponds 
to nucleotides 248 to 696 of SEQ ID NO: 42. Based on DNA sequence 
analysis of the TEH genes, it was determined that TEH1 and TIH3 were novel 
sequences that were not representative of any protein motif present in the 
GenBank database (July 8, 1993). TIH2 sequences were identified in the 
database as similar to a yeast open reading frame having no identified 
function. (GenBank Accession No. Z23261, open reading frame YBL0506) 
When the various HH proteins were used in the split hybrid 
assay in combination with Hrr25, it was observed that Hrr25/TIH3 binding, 
previously determined to be weaker than Hrr25/TIH2 or Hrr25/Tffll 
interactions, produced the lowest level of growth in the transformed yeast 
15 strain. 
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In order to isolate cDNAs which encode proteins that interact 
with CKI5, the two hybrid assay was performed using a LexA-CKIS fusion 
protein as bait. The coding region of CKIS was subcloned into a BamHL site 
of pBTMl 16 and transformed into a yeast strain designated CKI5/L40 (MAT 
ahis3 A200trpl-901 leu2-3 1 12 ade2LYS::(IexAo P ) 4 HIS3 URA3::(lexAop) 8 - 
lcZ GAL 4). CKL5/L40 was subjected to a large scale transformation with a 
cDNA library made from mouse embryos staged at days 9.5 and 10.5. 
Approximately 40 million transformants were obtained. Eighty-eight million 
were plated onto selective media lacking leucine, tryptophan and histidine. 
The ability of yeast transformants to grow in the absence of histidine 
suggested that there was an interaction between CKI5 and some library 
protein. 

In a second screening, interaction of the two proteins was 
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assayed by the ability of the interaction to activate transcription of 0- 
galactosidasc. Colonies that turned blue in the presence of X-gal were 
streaked onto media lacking leucine, tryptophan and histidine, grown up in 
liquid culture and pooled for isolation of total DNA. Isolated DNA was used 
to transform E. coli strain 600 which lacks the ability to grow on media 
lacking leucine. Colonies that grew were used for plasmid preparation and 
three classes of cDNA were identified." One class was closely related to a 
Drosophila transcription factor dCRJEBa. 

When CKI5/CREB interaction was examined in the split hybrid 
assay, cells were shown to grow on media containing histidine, but in the 
absence of histidine, growth was inhibited. Addition of small amounts of 
tetracycline to the cell culture restored the cell's ability to grow, suggesting 
that the interaction between CKI5 and CREBa was very weak. 

Example 6 
AKAP 79 Binding Assays 
Expression Plasmid TTtili^H 

In still another example of use of the split hybrid assay to 
examine protein/protein interactions, an anchoring protein for the cAMP 
dependent protein kinase, AKAP 79, was utilized separately with binding 
partner proteins including the cAMP protein kinase regulatory subunit type I 
(RI), the cAMP dependent protein kinase regulatory subunit type n (RH) or 
calcineurin (CaN). Plasmids used in the assay were constructed as described 
below. 

A 1.3 kb Ncol/Bamm fragment containing the coding region 
of AKAP 79 was isolated from a pETl Id backbone and ligated into plasmid 
pASl. Plasmid pASl is a 2 micron based plasmid with an ADH promoter 
linked to the Gal4 DNA binding subunit [amino acids 1-147 as described in 
Keegan et al. , Science. 231 :699-704 (1986)], followed by a hemagglutin (HA) 
tag, polyclonal site and an ADH terminator. The expressed protein was 
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therefore a fusion between AKAP 79 and the DNA binding domain of Ga!4 
Plasmids encoding RI, RTJ or CaN were isolated from a pACT 
■nunne T cei. library in a standard two hybrid assay using the AKAP 79 
expression consm,c, described above. Ptasmid pACT is a !eu2, 2 micron 
based plasmid containing an ADH promoter and terminator with the Oa.4 
transcription activation domain n [amino acids 768-88! as described in Ma 
and Ptasnne, CeU, 48:847-853 (,987„, foliowed by a mu„ip,e cloning site. 
RI. RD and CaN encoding plasmids were isolated as described below. 

A 500 ml SC-Trp yeast cell culture (ODgQQ = 0.6-0.8) was 
hatvested, washed with 100 m, distilled water, and repe.le.ed. The peHe, was 
brought up in 50 ml LiSORB (,00 mM lithium aceute, 10 mM Tris P H8 , 
mM EDTA pH8, and , M Sorbito!,, transferred to a I liter flaslc and shaken 
a. .20 rpm during an incubation of 30 minutes at 30'C. The cells were 

pelleted, resuspended in 625 u l LiSORR an H k.u 
I, DNA * LiiORB, and held on ice wlule preparing the 

The DNA was prepared for transformation by boiling 400 „ l 10 
mg/ml sahnon sperm DNA for 10 minutes after which 500 „, LiSORB was 
added and the solution allowed to slowly cool to room temperature DNA 
tarn a Mu T cell Ubnary was added (40-50 „g> from a 1 tng,,,,, stock. The 
■ced yeas, cell culture was dispensed into ,0 Eppendorf tubes with !20 „. of 
prepared DNA. The tubes were incubated a. 30'C with shaking at 220 RPM 

After 30 minutes, 900 M I of 40% PEG™ in inn m viT- 

rEU 3350 ,n IM 0 mM Li acetate, 10 mM 

Tns. P H 8, and 1 mM EDTA, p H 8, was mixed with each culture and 
mcubation continued for an additional 30 minutes. The samples were pooled 
and a smal, aliouo, (5 „l> was removed to test for transformation efficiency 
and ptated on SC-l^-Tn, plates. The remainder of the cells were added to 
iOO m, SC-Leu-Trp-His media and g row „ for one hour a. 30'C with shaking 
a. 220 RPMS. Harvested cells were resuspended in 5.5 ml SC-Leu-Trp-Hi! 
containing 50 mM 3AT (3-ami„o triazole) media and 300 A aliouots piated 
on 150 mm SC-Uu-Trp-His also containing 50mM 3AT. CeH were ief. to 
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grow for one week at 30°C. 

After four days, titer plates were counted and 1 . 1x10 s colonies 
were screened. Large scale 0-gal assays were performed on library plates and 
ten positive clones were isolated for single colonies. One of these colonies 
grew substantially larger than the rest, and was termed clone 11.1. Sequence 
from clone 11.1 revealed an open reading frame 487 aa long which was 
correctly fused to the GaI-4 activation domain of pACT. The NIH sequence 
database was searched and the sequence was found to be closely homologous 
to the human calmodulin dependent protein phosphatase, calcineurin. 

Additional screening using pACT Mu T-cell library DNA and 
the pASI AKAP 79 bait strain was performed in order to identify other AKAP 
79 binding proteins by the protocol described above. Results from screening 
approximately 21 1 ,000 colonies gave one positive clone designated pACT 2-1 . 
Sequencing and a subsequent data base search indicated that the clone had 
91 % identity with rat type lor regulatory subunit of protein kinase A (RI). 

The library was rescreened using the same AKAP 79 bait and 
fifteen positives were detected from approximately 520,000 transformants. Of 
these fifteen, eleven were found to be homologous to the rat regulatory 
subunit type I of PKA. Each of these isolates were fused to the 5' 
untranslated region of RI and remained open through the initiating methionine. 

Split Hybrid Analy sis 

In split hybrid analysis of AKAP79 binding interactions, a 
plasmid was first constructed for expression of a LexArAKAP 79 fusion 
protein. An AKAP 79 coding region was excised from pAS AKAP 79 as an 
NcolfBamHl fragment and inserted into pBTMl 16 previously digested with the 
same enzymes. The resulting plasmid was designated pBTMI 16-AKAP79. 

Approximately 50,000 W303 yeast cells (strain YI665, see 
Table 1) in logarithmic growth were rinsed in media lacking histidine, 
suspended in 100 M | to 200 M l of the same media, and plated on agar lacking 
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histidine (to select for absence of protein/protein interaction) and also lacking 
leucine and tryptophan (to select for transforms bearing expression 
constructs encoding AKAP 79 and its binding partner). When RH was 
employed as the AKAP 79 binding partner, 2*4^ tetracycline and 5 mM 
3AT were required to prevent the transformed host from growing under 
conditions where the expressed proteins interacted. 

Once conditions were established under which growth of the 
transformed host was eliminated, various candidate inhibitor compounds were 
separate* added to the agar. It was presumed that if one of the candidate 
compounds was capable of disrupting AKAP 79 interaction with the binding 
partner protein, growth of the transformed host should be detectable in the 
v.c.n.ty of the compound on the agar. In the split hybrid assay wherein 
AKAP 79 and RH binding was examined, 2 M 1 of a 30 mM stock solution of 
ICOS Compound 4273 in DMSO, 2 M l of a 10 mM stock solution of ICOS 
Compound 1062 in DMSO, and 2 ^ DMSO alone (as a negative control) 
were spotted on to the plate which was incubated at 30°C for four to five 
days. For ICOS Compound 4273 a ring of growth was detected. 

In order to determine an IC 50 for an inhibitor identified as 
described above, alternative methods may be used. In one method, the 
inhitaor compound is added to the agar over a range of concentrations 
Ideally, the compound is diluted to the point that host cell growth is 
essentially not detectable. 

In another method, a 96 well plate is used and the compounds 
of mterest are serial* diluted across one row of a 96 well plate, one 
compound per row. Media lacking histidine, tryptophan, and leucine is added 
(presuming that the expression plasmids encoding the binding partners also 
encode trp and leu proteins) along with the appropriately transformed host 
yeast strain. Tetracycline and 3AT are added at concentration previously 
determined to extinguish growth of the transformed host cell After two to 
five days incubation at 30>C, the plate well, are read at approximately 600 
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nm using a plate reader. The concentration of inhibitor half way between zero 
and the lowest concentration that permits growth of the host cell to the level 
observed on media containing histidine is estimated to be IC 50 . 

A modification of this second method is particularly amenable 
for use in a high throughput screen of large numbers of candidate inhibitors. 
For example, rather than attempting to determine the IC 50 for a previously 
.dentified inhibitor, separate candidate inhibitors are added to each well of a 
96 well plate, preferably at more than one concentration, and host cell growth 
determined after several days incubation. Inhibitory activity of compounds 
identified in this manner is confirmed on an agar plate and the IC 50 
determined on 96 well plates, each assay as described above. 

Example 7 

General Application of The Split-Hybrid Screen 
In order to examine general utility of the split hybrid system 
various experiments were conducted with binding proteins known to interact.' 
In addition, a number of control experiments were included in order to 
determine if the effects observed with the known binding partners were in fact 
due to protein/protein interaction. 

A - Yeast Assay Strain rnr C ,n,., irn 

Yeast transformants used in assays indicated below were 
derived from LYS2-deficient strains AMR69 (Mat a hisS fys2 leu2 trpl 
UKA3:LexA::La C Z) and AMR70 (Mat „ his3 fys2 trpl Uu2 
URA3:LexA::I^c2) [Hollenberg, et al., Mol. Cell. Biol. 15, 3813-3822 
(1995); Chien, et al., Proc. Natl. Acad. Sci. (USA) 88:97578-9582 (1991)- 
Fields and Song, Nature 340:245-246 (1989)]. Yeast were grown in YEPD 
or selective minimal medium using standard conditions [Sherman, F et al 
Methods in Yeast Oq gfra, Cold Spring Harbor Lab., Cold Spring Harbor' 
NY (1986): Methods in Enzymology, Vol. 194 Guide to Yeast Genetics and 
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Molecular Biology. Eds. chrjs , ine and Hn](J rf ^ 

and AMR70 strains lacking URA3 were firs, generated hy leaking ce.is on 
symhettc media containing 5 mg, m , 5-fluoro-orotic acid (5FOA) [ Maho ,is in 
EnvnCosy, Vol. ,94 Guide to Yeas, Genetics and Molecuiar Biology Eds 
Chnstine and Fink,. Two ^ deficjen , ^ ^ ^.^ ^ ^ ^ 

fac, ,ha, these s.rai„s were subsequently mated. URA3-defic,en, colonies 
were confirmed by testing for uracil auxotrophy and deletion of the 
URA:LexA,Lac 2 locus was confirmed by an absence of 0-galacosidase 
ac„v„y assayed by standard methods. The mutant strains selected we re 
« designated 69-4 and 70-1 

Targeted integration of pRS306/8xlex Aop/TetR was carried out 
by transforming (Ho.lenberg, MoL Celt . BM , s ( 

•he 69-4 strain with plasmid linearized a. a unique Ncol site. The reporter 
gene construct was constructed using parental plasmid pR S306 which encodes 
URA3 as a selectable marker. Stably integrated plasmid .hereby permitted 
selection on media lacking uraci,. T „e positive untci, prototrophic strains 
were examined by Southern analysis ,„ confirm insertion of the plasmid 
sequences. 

Targeted integration of pRS303/2x.=top-LYS was carried ou, 
by transformation (Hollenberg, „ at., supra] of strain 70-1 with p,asn,id 
Imeanzed a, a unique Hpal site. The resulting , ysi „e prototrophic strains 
were examined by Southern analysis to confirm insertion of the plasmid DNA 

riKimn ^ AMR6 ' **~*" Smin ^ Q) """^ 
pRS303/ 2 x,e,^ L Y S insertion was mated with the AMR70-deriva.ive strain 

(MAT a) containing pRSao^SxUxAop/TetR and mated cells were selected 
on media lacking both lysine and uracil. Single colonies were g ro wn up and 
tested for the ability ,„ grow on medja |ackjng ^ ^ ^ 

was designate YI584. In instances where yeas, strains were transfonned with 
other render gene pair combinations, the strains were uniquely designated. 

Yeast bearing integrated reporter gene constructs were 
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subsequently transformed [Hollenberg, etai, supra] with plasmids encoding 
chimeric binding protein. Plasmids encoding the LexA DNA binding region 
were generally derived from parental plasmid pBTMU6 which also encodes 
TRP1 as a selectable marker. Plasmids encoding the VP16 transactivating 
5 domain were generally derived from parental plasmid pVP16 which also 
encodes LEU2 as a selectable marker. Yeast cells which were successfully 
transformed with the four exogenous plasmids were therefore selected by an 
ability to grow on media lacking lysine, uracil, tryptophan, and leucine. 
Plasmids encoding various binding proteins were transformed into the yeast 
10 assay strain as indicated below. 



B. Liquid Assay 

After three days growth at 30 °C on selection media as 
described above, a pool of colonies from each transformation was collected 
and diluted in 5 ml selective media. The mixture was vortexed and 
1 5 immediately sonicated for ten seconds. Cells in the resulting suspension were 
counted and seeded at 1000 cells/ml in selective media, 2 ml per 15 ml tube. 
Tetracycline. 3AT, and histidine were included as determined appropriate by 
the method described above. Each aliquot of cells was incubated with shaking 
for two days at 30°C and cell density measured at OD^. 

C. Characterization of the Assay 

The utility of the split-hybrid assay was first determined using 
well characterized binding proteins and various controls. 

In an initial study, YI584 cells were transformed with plasmids 
pLexA-VP16 and pLeu. While the expressed proteins from the two plasmids 
25 do not interact, pLexA-VP16 encodes a fusion protein containing the VP16 
activation domain fused directly to LexA which contains a DNA binding 
domain. The chimeric LexA-VP16 protein is a strong transactivator for a 
promoter containing LexA operators. Plasmid pLeu is essentially a blank used 
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as a control co-transformation plasmid. 

express TetR Y T" ' ranSf °™ ed ^ XA - VP ' 6 PlaSmW »™ » 

press TetR protem as indicaled „ y ge| shjft 

absence of h-sudme. Combined, these observations suggested tha, 
overexposed TetR pro*, was ^ „ ^ ^ ^ 

P~g the nn of ^ ^ tramfonned J »«- 

co na ,„, ng „,, idine , ^ indjcating ihai ^ ; - 

have a toxic effect on the assay ceils. 

supported the " r rcS °'' S C °" S ' Stem PrcVi ° US <*~™ - 

supported the earlter suction tha, activation of TetR expression either 

' MPaWe ° f PreVem '"S «V <*H growth on media .acking 
h.s..d,ne, presumably by eliminating HISS production. 

o i*a tt • * Example 8 

Sp.«-Hybnd Assay With Weakly ,„ teracting Binding Proteins 

Protetn/protein interaction was examined in the split-hybrid 
assay to determine utility of the . „ 

in,,™, * " g " WO fusio " P rotei " s known to 

ntenact weakly. ,„ «. 

fragment of a CAMP reguUtory biding protein (CREB283) fused to 
and a foment of the CREB binding protein consiai „ g of ^ CREB 
domain (CBD) fused to VP16. ^ 

a d transformation carried out as previously described. In . firel ass ^ 
P.asm,d s pU^CMB and pVr, 6 -CB D were transformed into ,he « , ^ 
cell growth was observed in the absent „r h ■ . ne ceils and 

„, ,„ , . Mce of h ' SI 'd"ne in the media. Expression 

of -he fuston proteins was confinned by Western blotting. AumfU ,„ 
ecrease eel, gro wth by Utralioo wit „ JAT were ^ ^ » 

concentrate of 3 A T reared ,0 reduce growth in ce„s tra n sfonned ^ 
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pLcxA-CREB and pVP16-CBD also eliminated growth in cells transformed 
with pLexA-CREB and the control plasmid pVP16. 

In light of these results, two alternative approaches were taken 
in order to permit study of binding proteins wherein the interaction is 
relatively weak. Under the assumption that the system was failing at the level 
of TetR transcription, alternative approaches were taken in attempts to amplify 
the TetR effect on expression of HIS3 gene. To achieve this end, assay cells 
were transformed with reporter constructs which encoded multiple let operator 
sequences upstream from the HIS3 gene. In the second approach, the HIS3 
promoter used to drive expression of the TetR gene was replaced with the 
stronger alcohol dehydrogenase (ADH) promoter. 

In YI596 cells wherein the ADH promoter replaced the HIS3 
promoter to drive TetR expression, transformation with plasmids pLexA- 
CREB and pVP16-CBD showed substantially decreased growth on his' media 
15 as compared to that in assay strain YI592 wherein the HIS3 promoter was 
used to drive TetR expression. However, in cells transformed with plasmids 
pLexA-CREB 341 -Ml and pVP16-CBD, no decrease in assay cell growth was 
detected on media lacking histidine. These results indicate that incorporation 
of the ADH promoter to drive TetR expression may be more useful in studies 
20 involving binding proteins that have low affinity. 

When assay strains were utilized which incorporated plasmids 
wherein expression of the HIS3 gene was driven by multiple copies of the let 
operator, transformed cell lines did not grow well enough to indicate potential 
utility in subsequent assays. 
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Example 9 
General Assay Methods 

A - "Fine Tuning" 

In instances where either of the test f usion proIeins ^ 
mtnnsic capacity for ttanscriptiona, activation, TetR win be expressed and 
growth of the assay strain media lacking histidinc will he depressed 
proportion* to the level of TetR expression. ,n order to restore growth of 
■hese cells to approximately the ,evel onsetved on media containing histidine 
<he initially transformed assay yeas, strains are grown i„ ,h e p^nce of 
■ncreasing concentrations of tetracycline which binds to the TetR gene product 
and prevents TetR binding ,„ the ,„ operator. Precise titration of express 
TetR w,,h tetracydine, only to the point that g row .„ of the assay strain is 
restored to the level detected in the presence of histidine, pennits detection of 
subsequent decreased growth of the assay strain following increased TetR 
expression resulting f ron , interactio „ of , he ^ ^ 

empirically determined tetracycline concentration is therefore employed to 
increase "signal-to-noise" rarios under assay conditions. 

After an appropriate tetracycline concentration has been 
determined for each of the candidate assay stntins, the ceUs are transfcmed 
w..h the second plasmid encoding the second fusion binding protein As 
before, growth of each candidate assay sttain is examined on media in the 
presence and absence of histidine. A desirable yeast assay strain is chosen 
whtch shows vigorous growth in the presence of histidine and negligible 
growth on media lacking histidine (indicative of the expected protein/protein 
interact™ and resultant decreased expression of MSJ). 

In instances where binding between the two test proteins is 
company weak. TetR expression may „ ot ^ sufflciem|y ^ 
abohsh HIS3 expression and cells expressing the resultant low levels of H1S3 
wl. still grow on media which lack s histidine. Cells which show this low 
level of viability are grown in the presence of increasing concentrations of 3- 
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aminotriazole (3 AT), a competitive inhibitor in the histidine synthesis 
pathway, in order to reduce cell growth to negligible levels when plated on 
media lacking histidine. As with titration of TetR with tetracycline, addition 
of 3AT to the media is designed to increase the signal-to-noise ratio by 
providing significant changes in growth in the presence and absence of 
histidine in the media. 

In a practical application of the methods for Fine tuning, binding 
between CREB and the CREB binding protein (CBP) is illustrative. Growth 
of the yeast strain YI584 transformed with pLexA-CBD, encoding the CREB 
binding domain (CBD) of CBP, and pVP16-CREB or pLexA-CBD and the 
control piasmid pVP16 was substantially decreased and virtually 
indistinguishable growth rates were detected in both instances on media 
lacking histidine. This observation indicated that the LexA-CBD protein 
product possessed sufficient transactivating capacity to eliminate HIS3 
production. In order to distinguish growth differences between assay cells 
transformed with either pVP16 and pVP16-CREB, increasing amounts of 
tetracycline were added to the media lacking histidine. 

In both transformants, tetracycline was able to relieve growth 
repression in a dose dependent manner, and at increasing concentrations of 
tetracycline, the difference in growth between the two colonies was 
increasingly magnified, with the most distinct growth difference observed 
following addition of tetracycline at 10 /tg/ml. Addition of tetracycline was 
therefore able to overcome the intrinsic transactivating capability of the LexA- 
CBD fusion protein. 

Because the ultimate use of the split-hybrid system is for 
structure-function studies, mutagenesis studies, drug identification and library 
screens, it is important to minimize background growth that might be confused 
with disrupted protein-protein associations. This can be accomplished by the 
addition of 3AT, a competitive inhibitor of the HIS3 gene product. For 
instance, in the presence of 10 M g/ m i of tetracycline, the yeast strain 
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transfonned with pI^xA-CBD and P VP16-CREB still conferred approximately 
1 2 % growth of that observed in the presence of his + media. To diminish this 
background, increasing concentrations of 3AT were added to the media in the 
presence of 10 M g/ml of tetracycline. At the 3AT concentration of 0.25 mM, 
5 the growth of the yeast strain expressing LexA-CBD and VP16-CREB was 
below 5%, while the growth of the control strain was still maintained at 70% 
of control levels. These results indicate that split-hybrid system can be 
.nodulated by 3AT in addition to tetracycline in order to effectively increase 
the signal-to-noise ratio. 

10 B - Preparation of f. vr r^tc 

In order to assess the utility of various plasmids to function in 
the split-hybrid assay, a number of control experiments can be employed 
which lend insight into expression of a desired protein from the transformed 
plasmid. For example, standard immunological methodologies, i.e., 
15 immunoprecipitation, ELISA, etc. , can be used to determine to the extent to 
which a desired protein is expressed. Similarly, a variation of the gel shift 
assay (discussed immediately hereafter) can be used to determine both if a 
protein is expressed and if the expressed protein is capable of DNA binding. 
In each of these control assays, a yeast extract is required which can be 
20 prepared as follows. 

Extracts were prepared as described by Uppaluri and Towle 
[Mot. Cell. Biol. 15:1499-1512 (1995)] and were used for electrophoretic 
mobility shift assays as discussed below. The yeast cells transfonned with 
P LexA-VPI6 were grown in 100 ml of selective synthetic medium lacking 
uracil, tryptophan, and lysine to a density of A^ = 1 . Cells were harvested 
and washed with 5 ml of EB (containing 0.2 M Tris-HCl, pH 8.0, 400 mM 
(NH 4 ) 2 S0 4 , 10 mM MgCl 2 , 1 mM EDTA, 10% glycerol, and 7 mM 0- 
mercaptoethanol). Cells were transferred to microcentrifuge tubes and 
collected by centrifugation. After resuspending in 200 M l EB containing 1 
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mM phenylmethylsulfonyl fluoride (PMSF), l Mg /ml leupeptin, and l^g/ml 
pepstatin, a one-half volume of glass beads was added. The suspension was 
frozen in a -80°C freezer for 1 hour and thawed on ice. Thawed cells were 
vortexed at 4°C for 20 minutes, after which an additional 100 fxl EB was 
added, and cells were left on ice for 30 minutes. The suspension was 
centrifuged for 5 minutes, the supernatant was transferred to a new tube which 
was centrifuged for 1 hour in a microcentrifuge. The supernatant was then 
made to 40% with (NH 4 ) 2 S0 4 and gently rocked for 30 minutes. After a 10 
minute centrifugation, the pellet was resuspended in 300 „l of 10 mM 
HEPES, pH 8.0, 5 mM EDTA. 7 mM 0-mercaptoethanol, 1 mM PMSF, 1 
Atg/ml leupeptin, and 1 M g/ml pepstatin, and 20% glycerol. The resulting 
suspension was dialyzed against the same buffer, and aliquots were stored at - 
80°C. 
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C Electroph oretic mobility shift assay * 

Electrophoretic mobility shift assays were performed as described by 
Shih and Towle [J. Biol. Chem. 267:13222-13228 (1992)]. Double-stranded 
let operator oligonucleotides were prepared by combining equivalent amounts 
of complementary single-stranded DNA (SEQ ID NOS: 7 and 8) in a solution 
containing 50 mM Tris-HCI, P H 8.0, 10 mM MgCI 2 , and 50 mM NaCl 2 , 
heating the mixture to 70°C for 10 minutes, and then cooling to room 
temperature. The annealed oligonucleotides were labeled by filling in 
overhanging 5 ' ends using the Klenow fragment of E. coli DNA polymerase 
I with [or- 32 P]dCTP. Binding reactions were carried out in 20 pi containing 
10 mM Tris-HCI, pH 7.5, 50 mM NaCl, 1 mM EDTA, 1 mM dithiothreitol, 
5 % glycerol, and 2 mg of poly[d(I C)]. A typical reaction contained 20,000 
cpm (0.5-1 ng) of end-labeled DNA with 3-5 „g of yeast extract. Following 
incubation at 22°C for 30 minutes, samples were ' separated on a 4.5% 
nondenaturing polyacrylamide gel containing 50 mM Tris, 384 mM glycine, 
and 2 mM EDTA, pH 8.3. For competition binding experiments, the 
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conditions were exactly as above except that specific and nonspecific 
competitor DNAs were included in the binding mixture before the yeast 
extract was added. The concentration of tetracycline, a competitive inhibitor 
of TetRJiet operator binding, was 1 ?M when utilized. 
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Example 10 

Application of the Split-Hybrid Assay to Identify Agents 
That Prevent Receptor Desensitization and Drug Tachyphylaxis 

Over half of the drugs that are used clinically affect the function 
of seven transmembrane receptors. Although many of the characteristics of 
these receptors are distinct, two general features appear to be conserved. One 
is the ability to signal through dissociation of heterotrimeric G proteins. The 
second is the capacity to lose responsiveness to ligand binding in a process 
termed desensitization which is mediated by receptor phosphorylation and the 
subsequent binding of factors that recognize the phosphorylated state of the 
receptor which prevents continued signaling. Desensitization results in an 
intrinsic limitation to drug action imposed by the action of the drug itself, i.e. , 
activation of a receptor by a hormone or drug initiates mechanisms thai 
prevent subsequent responses to repeated administration of the same agent. 
The coupled mechanisms of activation and deactivation together have been 
termed "homologous desensitization," while the inability of a drug to maintain 
its efficacy is known as "tachyphylaxis." Even though the mechanisms 
underlying homologous desensitization have been worked out in great detail 
over the past few years, there are currently no useful pharmacological 
approaches available that prevent the inactivation mechanism. 

The potential clinical utility of agents that could prevent or 
modulate drug desensitization is enormous. Four examples where therapy is 
limited by the inability of receptors to maintain responsiveness to drugs 
include: (i) asthma wherein desensitization of airway adrenergic receptors 



WO 98/13502 



PCT/US97/17276 



10 



15 



20 



25 



30 



- 59 - 

renders epinephrine treatment ineffective after a period of hours; (ii) 
congestive heart failure wherein desensitization of adrenergic and VIP 
receptors, coupled with an elevation of the 0 adrenergic receptor kinase 
03 ARK), prevents the inotropic effects of endogenous regulatory hormones; 
(iii) Parkinson's disease, wherein dopamine receptor desensitization limits the 
usefulness of agents like L-Dopa; and (iv) chronic pain wherein tolerance 
results from opiate receptor desensitization. Indeed, it is difficult to conceive 
of a pharmacological modality in use today that is not limited in its 
effectiveness by the phenomenon of desensitization. 

The biochemical basis for G protein-coupled receptor desensiti- 
zation involves three classes of proteins including arrestins, kinases and G- 
proteins. ail of which have been cloned rLefkowitz, Nature Biotechnology 
14:283-286 (1996)]. Following activation of a seven transmembrane receptor, 
a region is phosphorylated by one or more G protein-coupled receptor kinases 
(known as GRKs 1-6). For example, in the /3-adrenergic receptor (/JAR) and 
rhodopsin, the cytoplasmic tail is phosphorylated [Premont, et al., J. biol. 
Chem. 269:6832-6841 (1994): Freedman, et al., J. Biol. Chem. 270:17953- 
17961 (1995); Palczewski,*?/ al., J. Biol. Chem. 266:12949-12955 (1991); 
Palczewski, etal., J. Biol. C/iem. 270:15294-15298 (1995)] while in the m2 
muscarinic receptor, the third cytoplasmic loop is phosphorylated [Nakata. et 
al., Eur. J. Biochem. 220:29-36 (1994)]. The best characterized members of 
the family of G protein receptor kinases are the 0AR kinase G8ARK) and 
rhodopsin kinase which are both membrane-associated. While rhodopsin 
kinase contains an intrinsic membrane targeting signal [Inglese, et al., Nature 
359:147-150 (1992)], 0ARK appears to be targeted to the membrane by 
association with G protein 0y subunits [Pitcher, etal. , Science 257:1264-1267 
(1992); Inglese, et al.. Nature 359:147-150 (1992)]. Once the substrate 
receptor for each kinase is activated, presumably by ligand binding, the kinase 
associates and phosphorylates serine and threonine residues on the receptor. 
The phosphorylated receptor then becomes a binding target for one or more 
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other proteins. In the case of &AR, for example, phosphorylation al.ows 
bmdmg of arresting which prevents association with G proteins and promotes 
receptor sequestration and desensitization. Using the 0AR as an exemplary 
desensitization model, it becomes apparent that multiple steps in the pathway 
appear to provide potential points of reg ulation each of which is amenable to 
the spht-hybrid screen to identify molecules that can block the overall 
desensitization pathway. Specifical.y in the case of /3AR, the split hybrid 
system can be used to identify small molecules that: (i) pre vent interaction 
between 0ARK and the G protein 0 subunit; (ii) inhibit PARK activity and 
(in) disrupt the MRKrarresting complex. 

A - Plasm id Ponstnirtionc 

The study of G-protein receptor kinases in the split-hybrid 
system involves three or more recombinant proteins or two or more 
recombtnant proteins and a recombinant peptide library. In the split-hybrid 
system discussed above, two yeast primary expression plasmids are employed- 
pBTMl 16 [Bartel et al., Cellular Interactions in Development: a Practical 
Approach (ed ) Hartley, IRL Press, Oxford, pp. 153-179 (1993)], wfcch 
encodes the I^xA-fusion protein and the TRPI selectable marker, and pVP16 
[Hollenberg Mol. Cell. Biol., 15:3813-3822 (1995)], which encode, the 
VP16-fusion protein and the LEW selectable marker. In order to study 
mteracnons involving more than two recombinant proteins in the split-hybrid 
system, however, additional selectable markers are employed. Construction 
of adduional yeast expression plasmids which are used to examine interactions 
between more than two binding proteins is discussed below 
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1 Plasmid p DRM 

A DNA fragment comprising the ADH promoter and LexA 
sites, the TetR encoding gene, the nuclear localization signal, and the ADH 
terminator sequence are removed from pRS306/4xLexAop/ADH::TetR with 
Sacl. blunt-ended, and digested with Sail. The fragment is isolated and 
ligated into pRS303/2xtetop-LYS2 which has previously been digested with 
Notl, blunt-ended, and digested with Sail. The resulting plasmid, designated 
pDRM, is integrated into the LYS2 locus in the yeast genome as described 
above, and the resulting strain designated YIDRM. Placing the repressor gene 
and selectable marker reporter gene in the LYS2 locus allows ERA3 to be used 
a selectable marker. 



2. Plasmid pRSURAI 

A modified version of the pRS306 vector [Sikorski et aL, 
Genetics, 122:19-27 (1989)] containing the URA3 selectable marker gene is 
also used to encode additional recombinant proteins in the split-hybrid system. 
The plasmid, pRS426, has the 2 micron origin of replication inserted into a 
unique Aatl site of pRS306. Plasmid pRS426 is further modified in the 
following manner: 

(i) The ADH promoter sequence is amplified by PCR from 
BTM116 using primers which incorporate into the amplification product the 
DNA sequence encoding the SV40 large T antigen nuclear localization signal 
(NLS) and an initiating ATG sequence 3' to the ADH promoter. The ADH 
promoter/NLS/ATG sequence is inserted into the poly linker of pRS426. 

(ii) The ADH terminator sequence is amplified by PCR from 
BTM1 16 using primers which incorporate into the product a DNA sequence 
encoding an antibody tag, for example, FLAG, hemagglutinin protein (HA), 
or thioredoxin (Thio) (FLAG, HA, and Thio antibodies are available through 
Santa Cruz Biotechnology, Santa Cruz, CA) and DNA sequences encoding 
stop codons in all three frames to the 5' end of the ADH terminator sequence. 
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The antibody tag/stop codon/ADH terminator sequence is inserted into the 
polylinker of pRS426. 

3- Plasmid pPSArtpo 

PCR is used to engineer unique restriction sites, including for 
example, B g m, Eco47m, Mlul. Mel, and Sphl, immediately adjacent the 5' 
and 3' ends of the URA3 cassette in pRSURA3. The URA3 cassette is 
d.gested from P RSURA3 and replaced with the ADE2 cassette which is 
amplified by PCR. 

4. Plasmid pR TMl Ifi/Arvi 

A fragment containing the ADH promoter, polylinker, and 
ADH termmator is digested from P AD4 [Young etal.. Proc. Nafl. Acad Sci 
(USA), 55,7989-7993 (1989)J with BamHl, blunt-ended and inserted into the 
blunt-ended P VU I site of BTM116 as described [Keegan et al.. Oncogene 
72,1537-1544 (1996)], and the resulting vector designated P BTM116/AD4 
PCR ,s also used to engineer a nuclear localization signal 3' of the ADH 
promoter as described above. This vector contains the TRP1 selectable 
marker and can encode two recombinant proteins: (i) a LexA-fusion protein 
and (i.) a pro tein expressed from the pAD4 region of the vector. 

B gARK and G Protein R Wtndin£ 

In a first application of the split hybrid assay, disruption of 
btnd.ng between the carboxy-tenninal domain of #ARK, containing the 
pleckstnn homology (PH) domain, and the G protein 8 subunit (G0-.) is 
examined. Previous work indicates that the PH domain of 0ARK interacts 
directly with the By subunits of G proteins fPitcher, J.A., et al. Science 
257.1264-1267 (1992) and Touhara, K. et al. , J .Biol. Chem. 269:10217-10220 
(1994)]. Consistent with this observation i s work by Pumiglia, et al 
[Pumiglia. K.M., et al., J.BioLChem. 270:14251-14254 (1995)] which 



3NSOOCID; <WO 9813502A2J_> 



WO 98/13502 



PCT/US97/17276 



- 63 



10 



15 



20 



25 



30 



indicates that G/J 2 interacts with Rafl in yea st and that the interaction is 
disrupted by /J ARK in vitro. 

A DNA fragment containing the carboxy-terminal 222 amino 
acids (residues 467 to 689) of 0ARK1, which includes the PH domain, is 
amplified by PCR from bovine 0ARK1 [Pitcher et al., Science, 257:1264- 
1267 (1992)] and the gel-purified amplification product is inserted into 
pBTMl 16. The resulting plasmid is designated LexA-COOH-/?ARK. A DNA 
fragment containing the entire coding sequence of G0 2 [Fong et al., Proc. 
Nan. Acad. Sci. (USA), 54:3792-3796 (1987)] is PCR amplified from pGEM- 
HZf(-)G/J 2 [Inigez-Lluhi et al. , JBC, 267.23409-23417 (1992)] and the gel- 
purificd amplification product inserted into pVP16. The resulting plasmid is 
designated P VP16-G0 2 . PCR is used in a similar manner to clone the 
carboxy-terminal domain of 0 ASK into pVP16 and G/? 2 into pBTM116. 

0ARK and G0 2 binding is first examined in the two-hybrid 
system to determine if expression of either binding partner as a fusion protein 
in yeast affects protein/protein interaction. Binding of the two proteins is then 
examined in the split hybrid assay in order to determine if protein/protein 
interaction is capable of abolishing growth of the assay yeast strain. As 
above, addition of tetracycline and/or 3-aminotriazole required to maximize 
the difference in growth in the presence and absence of histidine is empirically 
determined. 

Split-hybrid yeast strains containing 0ARK and G/5 2 subunits 
are used to screen libraries of small molecules. Several types of small 
molecule libraries can be examined in the split-hybrid assay, including for 
example, chemical libraries, libraries of products naturally produced by 
microorganisms, animals, plants and/or marine organisms, combinatorial, 
recombinatorial, peptidomimetic, multiparallel synthetic collection, protein, 
peptide and polypeptide libraries. A library of small peptides can be cloned 
into pRSURA3 as described [Yang et al., Nuc. Acids Res., 25.1152-1156 
(1995) and Colas et al. , Nature, 550:548-550)]. Peptides corresponding to the 
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carboxy-terminus of 0ARK or other GRKs which have previous.y been shown 
to block calcium channel ^sensitization in intact neurons, presumably by 
blocking 0ARK and G/J 2 binding and subsequent trafficking of &ARK to the 
cellular membrane [Diverse-Pierluissi, et al. , Neuron 16:579-585 (1 996)] can 
be .dentified in such a screen. Further, it is important to show that the 
molecules identified through the split hybrid selection affect 0ARKG0 
mteract.on as opposed to, for example, tetracycline analogues identified in the 
screen that would not be useful to specifically modulate /JARK/G0, binding 



[) 



B Identification of flAPV inj.;kj mrr 

In a second approach, agents that directly inhibit 0ARK 
funcuon are identified in a modification of the split-hybrid system While 
identification of specific *ARK inhibitors may be difficult, preliminary data 
from spin hybrid assays using CREB/CBP binding partners indicates that the 
system can be used to identify serine kinase inhibitors. The serine kinase 
results also suggest several approaches can be employed in attempts to 
overcome potential problems in identifying 0ARK inhibitors. 

Briefly, binding between the phosphorylated G-protein coupled 
receptor (P-GR) and arresting is examined first in the standard two hybrid 
assay, followed by identification of inhibitors of P-GR/ arreS ting binding m the 
split hybrid assay. For these studies, fragments of three G protein-coupled 
receptors are examined: the carboxy-terminal tail of &M and the third 
cytoplasmic loop of the m2 muscarinic receptor. A DNA foment contamix,- 
the carboxy-terminal tail of the /J 2 AR (amino acids 330 to 413) is PGR 
-phfied rKolbilka „ aL, JSC, 262,7321-7327 (1987)] and the ge, purified 
product inserted into P BTMH6/Ad4 to produce a LexA-^AR fusion gene 
The resulting plasmid is designated pBTM-^AR/AXM. A DNA fragment 
containing the third cytoplasmic loop of the human m2 muscarinic receptor 
(nucleotides 268-324) is amplified from pGEX -I3 m2 fHaga er al TBC, 
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269.12594-12599 (1994)] by PCR and cloned into pBTMl 16/Ad4 creating a 
LexA-m2 fusion gene. The resulting plasmid is designated pBTM-m2/AD4. 
The entire bovine /iARKl coding sequence [Benovic et al., Science, 246:235- 
240 (1989)] is PCR amplified and cloned into the polylinker region originating 
from AD4 in pBTM-/3 2 AR/AD4 and P BTM-m2/AD4. The resulting plasmids 
are designated pBTM-/3 2 AR/ AD4-/3 ARK and pBTM-m2/AD4-/JARK, 
respectively. PCR is used to amplify the DNA fragment containing bovine 
^arresting- 1 (amino acids 1 to 437) [Loose, etal. Science, 245.1547-1550 
(1990)]. This fragment is inserted into pVP16 and is designated pVPI6- 
0arresting-l. PCR is used to amplify the DNA fragment containing rat 
/3arresting-2 (amino acids 1 to 428) [Attramadal, et al.. JBC, 267.17882- 
17890 (1992)] which is inserted intopVP16 to give plasmid pVP16-0arresting- 
2. A PCR strategy is also used to clone arresting into the pBTMl 16/AD4- 
0ARK plasmid and the 0AR and m2 fragments into pVP16. As above, the 
yeast split-hybrid YIDRM strain is transformed with the P-GR-arresting along 
with peptide libraries (cloned into pRSURA3) or grown following 
transformation in the presence of combinatorial drug libraries. 

Inhibitors identified in the split hybrid assay should effect 
disruption of protein/protein interaction either by: (i) inhibiting 0ARK 
phosphorylation of the receptor, thus preventing recognition of the receptor 
by arresting, or (ii) by physical disruption of binding between the receptor and 
arresting. Agents that allow yeast growth for trivial reasons, i.e. , tetracycline 
analogues, can be easily identified through use of simple controls. 

A first potential problem to overcome in this study is that 
cytoplasmic 0ARK enzyme must be targeted to the substrate receptor and, 
once targeted, must phosphorylate the receptor at appropriate sites. In normal 
cells, 0y association serves to target 0ARK to the cell membrane; the 0 
subunit binds to both the 0ARK PH domain and the isoprenylated y subunit 
in association with the membrane. One possible means to encourage the 
necessary specific interactions is to target the binding components in the assay 
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by tagging the proteins with nuclear localization signals, i.e., 0ARK. the 
receptor cytoplasmic tail, and arresting, to the nucleus. The plasmids 
proposed for the study of the P-GR-arresting interaction all contain nuclear 
localization signal sequences adjacent to recombinant gene sequence. 

A second problem is somewhat more difficult to approach. The 
current model is that receptors must be activated by ligand binding before 
being phosphorylated by 0ARK, i.e., targeting of 0ARK via fiy is not 
sufficient for receptor phosphorylation. There are two possible explanations 
for this requirement. The first is that phosphorylation sites on the receptor are 
masked in the absence of ligand and ligand binding causes a conformational 
change which "unmasks" the phosphorylation sites. If this is the case, a 
fragment of the receptor containing the immediate phosphorylation site may 
be used as the 0ARK target. However, although peptides representing 
portions of the 0AR cytoplasmic tail can be phosphorylated by 0ARK, the K m 
for the phosphorylation reaction is poor, suggesting that the kinase may 
require some other part of the receptor for binding and that the unmasking of 
this binding site by agonist is a critical step. 

This problem is addressed in two ways. In the first, the m2 
muscarinic receptor is used in place of the 0AR in view of previous results 
20 which indicate that the m2 protein is a good substrate for (3ARK. The third 
cytoplasmic loop of the m2 receptor serves as both the binding site and 
phosphorylation site for kinase and which should allow use of a LexA7m2 
receptor third cytoplasmic loop fusion gene as one component in the screening 
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system. 



An alternative approach is to artificially mimic the activated 
state of the receptor. Haga, et al. [J. Biol. Chem. 269:12594-12599(1994)] 
have shown that the activity of /J ARK can be stimulated in vitro in the 
presence of mastoporan, a bee venom peptide. Mastoporan is believed to 
mimic the cytoplasmic face of an activated receptor and has been shown to 
increase the affinity of 0ARK for a GST-m2 receptor fusion protein by over 



3NSDOC10: <WO 961 3S02A2J_> 



WO 98/13502 



PCT/US97/17276 



10 



- 67 - 

four orders of magnitude. The same effect can be seen by using peptides 
representing the flanking regions of the m2 third cytoplasmic loop. Thus, 
mastoporan should also activate 0ARK in the two-hybrid yeast strains, allow 
phosphorylation of the receptor fusion protein, and promote interaction with 
arresting. If mastoparan is needed, oligonucleotides containing the coding and 
non-coding nucleotide sequences of the 14-mer peptide (INLKALAALAKKIL- 
NH 2 , SEQ ID NO: 43) are annealed and Iigated into prSADE2. The yeast 
split-hybrid strain YIDRM is transformed with pBTM-/3AR (or m2)/AD4- 
0ARK, pVP16-arresting, pRSADE2-mastoparan, and a pRSURA3-peptide 
library or combinatorial drug library. 



Numerous modifications and variations in the invention as set 
forth in the above illustrative examples are expected to occur to those skilled 
in the art. Consequently only such limitations as appear in the appended 
claims should be placed on the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hoekstra, Merl F. 

S"rup?L^o? ION >p Meth ° dS t0 ^"ify Compounds For 
disrupting Protem/Protem Interactions 

(iii) NUMBER OF SEQUENCES: 43 
(ivj CORRESPONDENCE ADDRESS * 

HI ™E? SE 630^f ail 4- °' Toole ' Gerstein. Murray * Borun 

(?) ™f Chfcago earS T ° Wer ' 233 S ° Uth Wacker D ^ 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP : 60606-6402 

(v) COMPUTER READABLE FORM * 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: PC-DOS /MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA- 

(A) APPLICATION NUMBER - 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER- 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION * 
(A) NAME : 

IB) REGISTRATION NUMBER - 

(C) REFERENCE /DOCKET NUMBER: 27866/33424 

<ix) TELECOMMUNICATION INFORMATION • 

(A) TELEPHONE: 312/474-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 25-3856 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TTGGTGAGCG CTAGGAGTCA CTGCCAG 

27 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
TATACTCTAT CAATGATAGA GTAATTCATT ATGTGATAAT GCC 
(2) INFORMATION FOR SEQ ID NO : 3 : 

< i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATTACTCTAT CATTGATAGA GTATATAAAG TAATGTGATT TC 
(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 ; 
AATTCTG CTA GCCTCTGCAA AGC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CGCACGCGTC GAAGAAATCA CATTACTTTA TATA 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 32 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



34 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCACGCGTA TACTAAAAAA TGAGCAGGCA AG 

32 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH; 25 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 
CGCGTACTCT ATCATTGATA GAGTA 

25 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ATGAGATAGT AACTATCTCA TGCGC 

25 

(2) INFORMATION FOR SEQ ID NO:?: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CGCGTACTCT ATCATTGATA GAGTCTAGAC TCTATCAATG ATAGAGTA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 
GCGACGCGTG CATGCCGTCT TCAAGAATTC CTCGAG 

3 6 



48 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCGACGCGTG CATGCCCACC GTACACGCCT ACTCGA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CATGGCATGC AAAAAAAAAG AGTCATCCGC TAGG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATGGCATGC TTAGCGATTG GCATTATCAC AT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
TAATACGACT CACTATATAG GG 
<2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii> MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
TCTAGACTTT GCCTTCGTTT ATC 

23 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGAAGGCAAA GATGTCTAGA TTAGATAAAA G 

31 

(2) INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:17 : 
CGCGGATCCG CTTT CTCTTC TTTTTTGGAG ACCCACriTC ACATTTAAG 49 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
AATTGCTCGA GTACTG TATG TACATACAGT AG 

32 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
AATTCTACTG TATGTACATA CAGTACTCGA GC 

32 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE ; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CCGGAATTCT CGAGACATAT CCATATCTAA TC 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 
<B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCGGAATTCA CTAATCGCAT TATCATC 
(2) INFORMATION FOR SEQ ID NO:22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CATGCCATGG CCATGTCTAG ATTAGATAAA AG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCGAATTCGC CAGGGCAACA GAATG CCACT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CGGGATCCTG GCTGGTTACC CAGGATGCCT TG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 3 0 base pairs 
<B) TYPE : nucleic acid 

STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGCGGATCCG GATGACCATG GACTCTGGAG 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS • 

St iSS™'- 30 base P ai " 

B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2S : 
CGCGGATCCT TAATCTGACT TGTGGCAGTA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS - 
(A) LENGTH: 32 base pairs 
B TYPE: nucleic acid 
n STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGCGGATCCC CATGACCATG GAATCTGGAG CC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



32 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CGCGGATCCG TGCTGCTTCT TCAGCAGGCT G 
31 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
ATGGTACCAG CGGCCGCTAG TCGTT T TACA ACGTCGTGAC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

ATGGTACCGC GGCCG CTTAT TTTTGACACC AGACCAAC 
3 8 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGGAGATCTA AAGAGACTTT TCTCCGGAAC TCAG 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 

CGGAGATCTT TACAGGAAGA CTGAACTGT 
29 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CCACCGCGGC AGTGCCAACC CCGATTTAC 
29 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS * 
<A) LENGTH: 2 8 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CATCCGCGGT GGTGATGGCA GGGGCTGA 



(2) INFORMATION FOR SEQ ID NO: 35: 

U) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGCTATCGAT ACGGCCCCCC CGACCGAT 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36: 
GCGTATCGAT CTACCCACCG TACTCGTC 



(2) INFORMATION FOR SEQ ID NO : 37 : 

(i) SEQUENCE CHARACTERISTICS - 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

C CTACTCTTA GGCCCGGGTC TTTTTAATGT ATCC 
34 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 

GGAATCACTA CAGGGATG 
18 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
ATGGACTTAA GAGTAGGAAG GAAATTTCGT ATTGGCAGGA AGATTGGGAG TGGTTC CTTT 
GGTGACATTT ACCACGGCAC GAACTTAATT AGTGGTGAAG AAGTAGC CAT CAAGCTGGAA 
TCGATCAGGT CCAGACATCC TCAATTGGAC TATGAGTCCC GCGTCTACAG ATACTTAAGC 
GGTGGTGTGG GAATCCCGTT CATCAGATGG TTTGGCAGAG AGGGTGAATA TAATGCTATG 
GTCATCGATC TTCTAGGCCC ATCTTTGGAA GATTTATTCA ACTACTGTCA CAGAAGGTTC 
TCCTTTAAGA CGGTTATCAT GCTGGCTTTG CAAATGTTTT GCCGTATTCA GTATATACAT 
GGAAGGTCGT TCATTCATAG AGATATCAAA C CAGACAACT TTTTAATGGG GGTAGGACG C 
CGTGGTAGCA C CGTTCATGT TATTGATTTC GGTCTATCAA AGAAATACCG AGATTTCAAC 
ACACATCGTC ATATTCCTTA CAGGGAGAAC AAGTCCTTGA CAGGTACAGC TCGTTATGCA 
AGTGTCAATA CGCATCTTGG AATAGAGCAA AGTAGAAGAG ATGACTTAGA ATCACTAGGT 
TATGTCTTGA TCTATTTTTG TAAGGGTTCT TTGCCATGGC AGGGTTTGAA AGCAACCACC 
AAGAAACAAA AGTATGATCG TATCATGGAA AAGAAATTAA ACGTTAGCGT GGAAACTCTA 
TGTTCAGGTT TACCATTAGA GTTTCAAGAA TATATGGCTT ACTGTAAGAA TTTGAAATTC 
GATGAGAAG C CAGATTATTT GTTCTTGGCA AGGCTGTTTA AAGATCTGAG TATTAAACTA 
GAGTATCACA ACGACCACTT GTTCGATTGG ACAATGTTGC GTTACACAAA GGCGATGGTG 
GAGAAGCAAA GGGACCTCCT CATCGAAAAA GGTGATTTGA ACG CAAATAG CAATGCAGCA 
AGTG CAAGTA ACAGCACAGA CAACAAGTCT GAAACTTTCA ACAAGATTAA ACTGTTAG C C 
ATGAAGAAAT TCCCCACCCA TTTCCACTAT TACAAGAATG AAGACAAACA TAATCCTTCA 
CCAGAAGAGA TCAAACAACA AACTATCTTG AATAATAATG CAGCCTCTTC TTTACCAGAG 
GAATTATTGA ACG CACTAGA TAAAGGTATG GAAAACTTGA GACAACAGCA GCCGCAGCAG 
CAGGTCCAAA GTTCGCAGCC ACAACCACAG CCCCAACAGC TACAGCAGCA ACCAAATGGC 
CAAAGACCAA ATTATTATCC TGAACCGTTA CTACAGCAGC AACAAAGAGA TTCTCAGGAG 
CAACAGCAGC AAGTTCCGAT GG CTACAAC C AGGGCTACTC AGTATCCCCC ACAAATAAAC 
AGCAATAATT TTAATACTAA TCAAGCATCT GTACCTCCAC AAATGAGATC TAATCCACAA 
CAGCCGCCTC AAGATAAACC AGCTGGCCAG TCAATTTGGT TGTAA 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS * 

<A> LENGTH: 2625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 796.. 2580 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

CATTTTCTTA ATTCTTTTAT GTG CTTTTAC TACTTTGTTT AGTTCAAAAC AATAGTCGTT 60 

ATTCTTAGGT ACTATAGCAT AAGACAAGAA AAGAAAAATA AGGGACAAAT AACATTAGCA 120 

GAAGTACGGT ATATTTTACT GTTACTTATA TACTTTCAAG AAGATGAGTT AAATCGGTAG 180 

CCAGTGTAGA AAAATAATAA TAAGGGTCAT CGATCCTTCG CATTTTATTA TCCAATTAAA 24 0 

GATACGAATC ACGGCAAACT ATATTCAAAG CTCATAGATA ATCGTCGTAA GGCTGACACT 300 

GCAGAAGAAA AGTCATAATT TGAATACTAG CCGGTATGAA ACTGTGATTG ATTAACCTGG 360 

GGTTACCTAA AGAGAACATA AGTAATACTC ATGACAGAAT CAAAACACAA TACAAAATTT 42 0 

ATCCGAACCT CGGCCCGACT GCGGCTCGCC GGGAAAGGGG ACAACCGCTT CTATCCGTCG 480 

ACTAACTTCA TCGGCCCAAT GGAAGCTATG ATATGGGGAT TTCCATTGAG CCGATAGCAA 54 0 

TGTAGGGTAA TACTGTTGCG TATATAGTGA TAGTTATTGA ATTTTATTAC CCTGCGGGAA 600 

TATTGAGACA TCACTAAGCA CGAATTTTAC GTCTGAGGAA AGTTGAATGA TGGCCAAATA 660 

ACCAGGAAAA ACAAATATTG AATCCTTGTG AAGGATTCCA CAGTTGTTTA ATCCTCCTTA 720 

AGCTCACTTA GTATCAATTG TCTAAATAAT ATTGCTTTGA ATCTGAAAAA AATAAAAGTA 780 

CCTTCG CATT AGACA ATG TCA CTG CCG CTA CGA CAC GCA TTG GAG AAC GTT 831 
Met Ser Leu Pro Leu Arg His Ala Leu Glu Asn Val 
1 5 io 

Th^ 1^ v 1 ? n AT n GA 57* GAC TTA ™ GTA CGT TTT ATT ATA 879 

Thr Ser Val Asp Arg lie Leu Glu Asp Leu Leu Val Arg Phe lie He 
*5 20 25 

AAT TGT CCG AAT GAA GAT TTA TCG AGT GTC GAG AGA GAG TTA TTT CAT 927 
Asn Cys Pro Asn Glu Asp Leu Ser Ser Val Glu Arg Glu Leu Phe His 
30 35 40 

5£ ^ 1^ 1°° m TAC ACG GAT ATC AAA TTG ATG AAT 975 

Phe Glu Glu Ala Ser Trp Phe Tyr Thr Asp Phe He Lys Leu Met Asn 

b 50 55 60 

pS Th? ^ o CC l CC T CTA A P" AAA TCA TTT GCT CAA TTG ATC ATA 1023 

Pro Thr Leu Pro Ser Leu Lys He Lys Ser Phe Ala Gin Leu He He 
65 70 75 

AAA CTA TGT CCT CTG GTT TGG AAA TGG GAC ATA AGA GTG GAT GAG GCA 
Lys Leu Cys Pro Leu Val Trp Lys Trp Asp He Arg Val Asp Glu Ala 

80 as 90 

CTC CAG CAA TTC TCC AAG TAT AAG AAA AGT ATA CCG GTG AGG GGC GCT 
Leu Gin Gin Phe Ser Lys Tyr Ly s Lys Ser He Pro Val Arg Gly Ala 
95 100 105 

GCC ATA TTT AAC GAG AAC CTG AGT AAA ATT TTA TTG GTA CAG GGT ACT 
Ala He Phe Asn Glu Asn Leu Ser Lys He Leu Leu Val Gin Gly Thr 



1071 



1119 



1167 
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110 H5 120 

GAA TCG GAT TCT TTG TCA TTC CCA AGG GGG AAG ATA TCT AAA GAT CAA 
Glu Ser Asp Ser Leu Ser Phe Pro Arg Gly Lys ?S JJI Ssp £5 

130 135 140 

AAT GAC ATA GAT TGT TGC ATT AGA GAA GTG AAA GAA GAA ATT GGT TTC 
Asn Asp He Asp Cys Cys He Arg Glu Val Lys G?5 gT5 ?S G^y 
145 150 15 5 

GAT TTG ACG GAC TAT ATT GAC GAC AAC CAA TTC ATT GAA AGA AAT ATT 
Asp Leu Thr Asp Tyr He Asp Asp Asn Gin Phe lie Glu Arg j£n nl 
160 165 170 

CAA GGT AAA AAT TAC AAA ATA TTT TTG ATA TCT GGT GTT TCA GAA GTC 
Gin Gly Lys Asn Tyr Lys He Phe Leu He Ser Gly S£ Se7 Si SE 
175 180 185 

TTC AAT TTT AAA CCT CAA GTT AGA AAT GAA ATT GAT AAG ATA GAA Trr 
Phe Asn Phe Lys Pro Gin Val Arg Asn Glu ill Sp £2 ™ SI J£ 

195 200 

TTC GAT TTT AAG AAA ATT TCT AAA ACA ATG TAC AAA TCA AAT ATC AAG 
Phe Asp Phe Lys Lys lie Ser Lys Thr Met Tyr Lys Ser ™ He JJs 

215 220 
TAT TAT CTG ATT AAT TCC ATG ATG AGA CCC TTA TCA ATG TGG TTA AGP 
Tyr Tyr Leu He Asn Ser Met Met Arg Pro Leu sSJ nit ™ ™ Jrg 
225 230 235 

CAT CAG AGG CAA ATA AAA AAT GAA GAT CAA TTG AAA TCC TAT GCG GAA 
H» Gin Arg Gin lie Lys Asn Glu Asp Gin Leu Lys 2r ™ JK 
240 245 250 

Glu Gin TTG TTG TTG GGT ATC ACT AAG GAG GAG CAG ATT GAT 

Glu Gin Leu Lys Leu Leu Leu Gly He Thr Lys Glu Glu Gin lie Asp 

z " 260 265 

Pro" 2?J ^ S G ^ P" G ^ ATG ™ ^ A CT GCA GTG CAA GCT AAC 
Pro Gly Arg Glu Leu Leu Asn Met Leu His Thr Ala Val Gin Ala Asn 

27 5 280 

S SS iS iE S fS E iiS s as ss s S S Si %Z "» 

290 295 300 

2S o^S SS SJ^SSSiSSSiSS 2S sk ss ^ x »« 

305 310 3 is 

CAG CAG TCA TCG TTT TCT TCT CAA CAA CAA CCT TCA ATA TTT CCA TCT 
Gin Gin ser Ser Phe Ser Ser Gin Gin Gin Pro 2J H* J£ Prt IS 91 
320 325 330 



Ser Si Pro "JhT Sf 1 ^ T *** 0X7 ATA CCA CCT ACT ATG 
Leu ser Glu Pro Phe Ala Asn Asn Lys Asn Val H e Pro Pro Thr Met 

J - 3 ' D 340 345 

Pro ^ JF? ™ ATG TCA CCT CAA TTG TTT GCG ACA ATG 

Pro Met Ala Asn Val Phe Met Ser Asn Pro Gin Leu Phe Ala Thr Met 

J=>U 355 360 

AAT GGC CAG CCT TTT GCA CCT TTC CCA TTT ATG TTA CCA TTA ACT AAC 



1215 



1263 



1311 



1359 



1407 



1455 



1503 



1551 



1599 



1647 



1839 



1887 



1935 
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AAA CCA GAT ACT TCC TTT TTA CCA AAT GAC TCC GTA TCT GGT ATA CAA 
Lys Pro Asp Thr Ser Phe Leu Pro Asn Asp Ser Val Ser Gly lie Gin 
510 515 520 



1983 



2031 



2079 
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Asn Gly Gin Pro Phe Ala Pro Phe Pro Phe Met Leu Pro Leu Thr Asn 
365 370 375 380 

AAT AGT AAT AGC GCT AAC CCT ATT CCA ACT CCG GTC CCC CCT AAT TTT 
Asn Ser Asn Ser Ala Asn Pro He Pro Thr Pro Val Pro Pro Asn Phe 
385 390 395 

AAT GCT CCT CCG AAT CCG ATG GCT TTT GGT GTT CCA AAC ATG CAT AAC 
Asn Ala Pro Pro Asn Pro Met: Ala Phe Gly Val Pro Asn Met His Asn 
400 405 410 

CTT TCT GGA CCA GCA GTA TCT CAA CCG TTT TCC TTG CCT CCT GCT CCT 
Leu Ser Gly Pro Ala Val Ser Gin Pro Phe Ser Leu Pro Pro Ala Pro 
415 420 425 

TTA CCG AGG GAC TCT GGT TAC AGC AGC TCC TCC CCT GGG CAG TTG TTA 
Leu Pro Arg Asp Ser Gly Tyr Ser Ser Ser Ser Pro Gly Gin Leu Leu 
430 435 440 

GAT ATA CTA AAT TCG AAA AAG CCT GAC AGC AAC GTG CAA TCA AGC AAA 2175 
Asp He Leu Asn Ser Lys Lys Pro Asp Ser Asn Val Gin Ser Ser Lys 
445 450 455 460 

AAG CCA AAG CTT AAA ATC TTA CAG AGA GGA ACG GAC TTG AAT TCA CTC 2223 
Lys Pro Lys Leu Lys He Leu Gin Arg Gly Thr Asp Leu Asn Ser Leu 
465 470 475 

AAG CAA AAC AAT AAT GAT GAA ACT GCT CAT TCA AAC TCT CAA GCT TTG 
Lys Gin Asn Asn Asn Asp Glu Thr Ala His Ser Asn Ser Gin Ala Leu 
480 485 490 

CTA GAT TTG TTG AAA AAA CCA ACA TCA TCG CAG AAG ATA CAC GCT TCC 2319 
Leu Asp Leu Leu Lys Lys Pro Thr Ser Ser Gin Lys He His Ala Ser 
495 500 505 



2127 



2271 



2367 



GAT GCA GAA TAT GAA GAT TTC GAG AGT AGT TCA GAT GAA GAG GTG GAG 
Asp Ala Glu Tyr Glu Asp Phe Glu Ser Ser Ser Asp Glu Glu Val Glu 
525 530 535 S40 

*r t GA f T ^ AGA AAT TCA TTG AAT GTA GAT ATT GGG GTG AAC 
Thr Ala Arg Asp Glu Arg Asn Ser Leu Asn Val Asp He Gly Val Asn 
54 5 550 555 

uIT m T ? ^ o GC 5** GAC AGC CGA AGA AGT CAA AAG GAA AAA CCA 

Val Met Pro Ser Glu Lys Asp Ser Arg Arg Ser Gin Lys Glu Lys Pro 
5S0 565 570 

AGG AAC GAC GCA AGC AAA ACA AAC TTG AAC GCT TCT GCA GAA TCT AAT 
Arg Asn Asp Ala Ser Lys Thr Asn Leu Asn Ala Ser Ala Glu Ser Asn 
575 580 585 

SeJ S3 G?S Sy SI G?y TAAATCTTCA CCCTCCGACT TCAGAGTAAC 2610 

590 595 



2415 



2463 



2511 



2559 



ACAGAATCCA CAGTA 



2625 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2050.. 4053 

<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
AGCTTCTCCC TTTTCCTTCA GTGCTGCTAC TCTCTG CTCT CCACTTAAGT GTTACAATTA 
ATTTGCAGCT AGTTTGCAGT TCGTACAACC TCGCCTATTC TTGTAACGAA GAAGAACGTA 
TTTATAATAT TGGGCTGTAA TGTGTTGAGT TTAGTAATAG ATAAAGTAGG ACAGAGTTCT 
GTCTTTGTTT ATCTATGGGG TTCAGAGTGA TAAGGGGCAG GATAAGGAAG TTAAAAAAAA 
AAAGGTTACG TTATATAACG AAAGAAAAGA AACGAGCGAA GTGCCAACTA TAGCCCAATA 
TCAAGAATGC AAGTCAGCAA AGTACAGTAA TCGTATGAAG ATACG CGATG CGTAATATCC 
CTCAAGGGCT CCGGATCAGA AAAGCTAAGG GAAGATCCTT ACATTACACG GCGTGCGACA 
GACTCGAACC ACAGCTAACT TCTCGTGAAA AGATGGCTTC AACTTCGCTC TTGCAATAAC 
TTTGAAACAC ACGAACAAAG GTTTATTGCG CTTGATTAAC GTTGGAAGTA TATGATACTA 
ATACTACTTT GTTCTCTAAG TCATCGCTAT ATGTTTATCT CGAGGAAAAG GTGCACGGCG 
GTACACAATT ACTTCGCCGT TTCGGGTAAA ACAAGTGTTA CATTTATAAT ATATATGTAT 
ATATGTATGT GCG CGTAAGT ATATGCCGTT CATAACAAAT CATCTTCTTG TTGCTGGATG 
GACTCCTTAA TTTTATTCAA AATGGTAATT TTCCATTTAT CTAGTCTCAT AAAATTGTCA 
AACTCCTTAC AGTGTTCGCT TAGCTGCTCG CTATCACCTT CATTAACAGC ATCGATTAAA 
CTTTTCAAGA AATTTGACTC CCTTGAATCC GCAAAATTCG GATCTTCACT TTGACCCTCT 
TGTAAAGTTC TTGCAGCAGC GACTGCATCA GTAGCAGCTA GCTGACAAAG CCCrrrTITT 
AGGAAGTAAT CCTTCAAACT CCATTGGCTC AATCTATTGC CCATGCTGCT CTTGATCAAC 
TTCGAATATA TATCACTTGC TTCAATATAT TGACCGTCAA GAGCCTTTAG ATCTGCGCAT 
TTGATAAAAC ACTTATTCGA TAATGCTACC GACTGGTCTT GGGCATACCA CTCACCAGCG 
AG CTCATAGC AATCTATAGC TTTTGCATAG TCATGCAAAT CATTTTCTAG AATTTCTCCA 
AGCTCAAACT TGAAATTAGC ACCTCTCCGG AACTGCCCCC TATGAGTAAA AATTTGAATA 
GCATTTTCTA ATGAATCCAC GGCGTTCACA GAGTTTCCAC CGCTTTTAAA GCATTTATAA 
G C CTCTACGT AGGTATTTCC TGCTTCGTCT TCATTACCAG CCTTTTTCTG ATAGTCAGCA 
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GCTTTCAAAA ACGAGTCTCC TGCCAAGTTT AACTCTTTTC TTAGACGGTA AATGGTGGCT 14 4 0 

GCTTGGACAC AAAGATCAGC AGCCTCCTCA AACTTGTATG AATCAGAACC GCTAAACAAT 1500 

TTCATGAAAC CCGATGAAGG AACACCCTTC TTCTCAGCCT TAACACAACG GGAAATATCA 15 60 

ATTCCCGTAT TTCAATGTTA GTAATTTGCC TTCGTAAATT ACGGAATCAC ATAGCTTTCA 1620 

TTTTGTTCCT TTGATATATT TCCCTACTAC ATACTCTTTT CAATAACTCT ACAGGGTCTG 1680 

ACATTTTTAA CTTTCAGGTT AATGATGGTG TTCTTACTAT ATTCTCGAGT CGTACAGAAG 174 0 

TTAGTTCAGA TAAACTG CTT CGGTGCTGCC CACTTCTTAT CATTACTTCA ACTTTACCTT 1800 

CCCTATACCT GTGTGTCCTT ATTAATTCAA GTTAATCCGA GGTAATAGAT TAGGGTAACC 1860 

TTCAATGATG TCACGAAACA CGGATGCTGC AACTTTGCGA TTTTTTCCTG GAAAAGAATA 19 2 0 

ACAATTAAAG GCAGCCTTTC AGCTGAGATT ACCAGCAGGT CTTTGGAGAT TAGCGCAAGA 19 80 

AGAAGTGTGA TATAGTACTC ATAGAGGCAG GCTACAGACT AGGGAAAGCG TGTTCAACAA 204 0 

CAATAAGAA ATG GAG ACC AGT TCT TTT GAG AAT GCT CCT CCT GCA GCC 2088 
Met Glu Thr Ser Ser Phe Glu Asn Ala Pro Pro Ala Ala 
1 5 io 

ATC AAT GAT GCT CAG GAT AAT AAT ATA AAT ACG GAG ACT AAT GAC CAG 2136 
He Asn Asp Ala Gin Asp Asn Asn He Asn Thr Glu Thr Asn Asp Gin 
15 20 25 

GAA ACA AAT CAG CAA TCT ATC GAA ACT AGA GAT GCA ATT GAC AAA GAA 2184 
Glu Thr Asn Gin Gin Ser He Glu Thr Arg Asp Ala He Asp Lys Glu 
30 35 40 45 



AAC GGT GTG CAA ACG GAA ACT GGT GAG AAC TCT GCA AAA AAT GCC GAA 2232 
Asn Gly Val Gin Thr Glu Thr Gly Glu Asn Ser Ala Lys Asn Ala Glu 
50 55 60 

CAA AAC GTT TCT TCT ACA AAT TTG AAT AAT GCC CCC ACC AAT GGT GCT 2280 
Gin Asn Val Ser Ser Thr Asn Leu Asn Asn Ala Pro Thr Asn Gly Ala 
^5 70 75 

TTG GAC GAT GAT GTT ATC CCA AAT GCT ATT GTT ATT AAA AAC ATT CCG 2328 
Leu Asp Asp Asp Val He Pro Asn Ala He Val He Lys Asn He Pro 
80 85 so 

TTT GCT ATT AAA AAA GAG CAA TTG TTA GAC ATT ATT GAA GAA ATG GAT 2376 
Phe Ala He Lys Lys Glu Gin Leu Leu Asp He He Glu Glu Met Asp 
95 100 105 

CTT CCC CTT CCT TAT GCC TTC AAT TAC CAC TTT GAT AAC GGT ATT TTC 
Leu Pro Leu Pro Tyr Ala Phe Asn Tyr His Phe Asp Asn Gly He Phe 
110 H5 120 125 

AGA GGA CTA GCC TTT GCG AAT TTC ACC ACT CCT GAA GAA ACT ACT CAA 
Arg Gly Leu Ala Phe Ala Asn Phe Thr Thr Pro Glu Glu Thr Thr Gin 
130 135 140 

GTG ATA ACT TCT TTG AAT GGA AAG GAA ATC AGC GGG AGG AAA TTG AAA 
Val He Thr Ser Leu Asn Gly Lys Glu He Ser Gly Arg Lys Leu Lys 
14 5 150 155 
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AGT ATT AGC AAT CAG GCC GTT AAC AAT TCT TCC AAC AGC AAT ACT ATT 3336 
Ser lie Ser Asn Gin Ala Val Asn Asn Ser Ser Asn Ser Asn Thr lie 
415 420 425 

AAC AGT AAT AAC GGT AAC GGT AAC AAT GTC ATC ATT AAT AAC AAT AGC 33 84 

Asn Ser Asn Asn Gly Asn Gly Asn Asn Val He He Asn Asn Asn Ser 
430 435 440 445 

GCC AGC TCA ACA CCA AAA ATT TCT TCA CAG GGA CAA TTC TCC ATG CAA 3432 
Ala Ser Ser Thr Pro Lys He Ser Ser Gin Gly Gin Phe Ser Met Gin 
450 455 460 

CCA ACA CTA ACC TCA CCT AAA ATG AAC ATA CAC CAT AGT TCT CAA TAC 34 80 

Pro Thr Leu Thr Ser Pro Lys Met Asn He His His Ser Ser Gin Tyr 
465 470 475 

AAT TCC GCA GAC CAA CCG CAA CAA CCT CAA CCA CAA ACA CAG CAA AAT 
Asn Ser Ala Asp Gin Pro Gin Gin Pro Gin Pro Gin Thr Gin Gin Asn 
480 485 490 

GTT CAG TCA GCT GCG CAA CAA CAA CAA TCT TTT TTA AGA CAA CAA GCT 
Val Gin Ser Ala Ala Gin Gin Gin Gin Ser Phe Leu Arg Gin Gin Ala 
495 500 505 

ACT TTA ACA CCA TCC TCA AGA ATT CCA TCC GGT TAT TCT GCC AAC CAT 
Thr Leu Thr Pro Ser Ser Arg He Pro Ser Gly Tyr Ser Ala Asn His 
510 515 520 525 

TAT CAA ATC AAT TCC GTT AAT CCC TTA CTG AGA AAT TCT CAA ATT TCA 3672 
Tyr Gin He Asn Ser Val Asn Pro Leu Leu Arg Asn Ser Gin He Ser 
530 535 540 

CCT CCA AAT TCA CAA ATC CCA ATC AAC AGC CAA ACC CTA TCC CAA GCG 3720 
Pro Pro Asn Ser Gin He Pro He Asn Ser Gin Thr Leu Ser Gin Ala 
545 550 555 

CAA CCA CCA GCA CAG TCC CAA ACT CAA CAA CGG GTA CCA GTG GCA TAC 
Gin Pro Pro Ala Gin Ser Gin Thr Gin Gin Arg Val Pro Val Ala Tyr 
560 565 570 

CAA AAT GCT TCA TTG TCT TCC CAG CAG TTG TAC AAC CTT AAC GGC CCA 
Gin Asn Ala Ser Leu Ser Ser Gin Gin Leu Tyr Asn Leu Asn Glv Pro 
575 580 585 

TCT TCA GCA AAC TCA CAG TCC CAA CTG CTT CCA CAG CAC ACA AAT GGC 
Ser Ser Ala Asn Ser Gin Ser Gin Leu Leu Pro Gin His Thr Asn Glv 
590 595 600 60S 

TCA GTA CAT TCT AAT TTC TCA TAT CAG TCT TAT CAC GAT GAG TCC ATG 
Ser Val Has Ser Asn Phe Ser Tyr Gin Ser Tyr His Asp Glu Ser Met 
610 615 620 

TTG TCC GCA CAC AAT TTG AAT AGT GCC GAC TTG ATC TAT AAA TCT TTG 
Leu Ser Ala His Asn Leu Asn Ser Ala Asp Leu He Tyr Lys Ser Leu 
"5 630 635 

AGT CAC TCT GGA CTA GAT GAT GGC TTG GAA CAG GGC TTG AAT CGT TCT 
Ser His Ser Gly Leu Asp Asp Gly Leu Glu Gin Gly Leu Asn Arg Ser 
640 645 ^ 650 

TTA AGC GGA CTG GAT TTA CAA AAC CAA AAC AAG AAG AAT CTA TGG 
Leu Ser Gly Leu Asp Leu Gin Asn Gin Asn Lys Lys Asn Leu Trp 
655 660 665 
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TAATATATAC TOCCATOTT CTTATGATTAT AGAGTTTGTT TG GTATTTGT ATATCGCACG 
ATACAAGTAA TGAGGGGTGC TTACACAAGA TAAAAGATAA AAAAATATAT ATATATAATA 
-AACCATCA AAAACACCAT TGAAAAAAAA TATAAAAAAA AAAAAAAATA ACCGAATATG 
AATATGAAAT TAATGATCAT GATGAAGTTA a™ C TG AGAAACGTCA CC.AA.GXCG 
ATGAAACGAT GATAATGAAT GAATGATGAG G CTACTTTAA GTAACGCAAT GTAATCAAG P 
CAAAATTATC CCTCT^r T«rTXCCCr CTTTTGAGAT TTTATTTTTA ACCTACTACT 

MCTTTTm ttttgaacgt TcrrrrcccA catac™ tata TCG tat .tatatgtap 

GATGTTTAAT CACAGAGATG TTTCTACCTT ACTCGATATT CT TO GCAT TAATTGATA~ 
CTTGCTCACT GCA.CA.TGG CGGTATTTGT AGTATATAGA AAGTCGGGTA ACAATAATT" 
ATTGACATTT C^TAC AATGATCAGA GAAGAGCAGA AAGTTTCATA GTCAAACG^ 
CAGGCCAATT GAACAAGAAA TTATTCGTTT TOAG TCGT TGAGTGTTCA AC TC ACA TC P 
TATTTTGGTG GTTCTTGATT AATTGGGGGC TrCA^ GAAATAAAGA G TC GGGAAAA 
TAGCACAGAA ACAAAGCATA TTAAAAGAGG CAAAAGAAGA AAGAACGAAT ATAAAAGGTA 
AAAAAGGAAA AGCATTGCTA rTC^CTC ATAGGTGTTA ^CATACCGC CCTCTCTCT. 
CTrCCX^T CATTAATTAG TCTCCGTATA ATTTGCAGAT AATGTCATTA ACAGCAAACO 
ACGAATCGCC AAAACCCAAA AAAAATGCAT TATTGAAAAA CTTAGAGATC GATGATCTGA 
TACATTCTCA ATTTGTCAGA AGCGATACAA ATGGACATAG AACTACAAGA CGACTATTCA 
ACTCCGATGC CAGTATATCA CATCGAATAA GAGGAAGTGT TCGGTCTGAT AAAGGCCTTA 
ATAAAATAAA AAAAGGGTTG A^CCCAGC AGTCCAAACT TGCGTCAGAA AA.TCTTCTC 
AAAATATCGT TAATAGGGAC AATAAGATGG GAGCAGTAAG .TTCCCCATT ATTGAACCT? 
ATATTGAAGT CAGCGAGGAG TTGAAGGTTA GAATTAAGTA TGATTCTATC AAKTTTTTCf 
ATTTTGAAAG ACTAATATCT AAATCTTCAG TCATAGCACC TTTAGTTAAC AAAAATATa/ 
CATCATCCGG TCCTCTAATC GGGTTTCAAA GAAGAGTTAA CAGGTTAAAG CAAACA^ 
ATCTAGCAAC CGAAAACATG GAGTACCCAT ATTCTTCTGA TAATACGCCA TTCAGGGAW 
ACGATTCTTG GCAATGGTAC GTACCATACG GCGGAACAAT AAAAAAAATG AAAGA^ 
GTACAAAAAG AAC^ACCC ACCTGGGAAG ATAAAATAAA G^ACA TTTTTAGAAf' 
ACTCTAAGTC TCC AACGTAC ATTAATGGTA ACGTATCACT TTGCAATCAT AATGAAACCG ' 
ATCAAGAAAA CGAAGATAGG AAAAAAAGGA AAGGGAAAGT ACCAAGAATC AAAAATAAAG 

TGTGGTTTTC CCAGATAGAA TACATTGTTC TTCGAAATTA TC^^ 

^■uAAATTA TGAAATTAAA CCTTGGTATA 
CATCTCCTTT TCCGGAACAC ATCAACCAAA ATAAAATGGT rm,., 

a-kmaatggt TTTTATATGT GAGTTCTGCC 
TAAAATATAT GACTTCTCGA TATACTTTTT ATAGACACCA ACTAlirw 

^ A< - At - t - A ACTAAAGTGT CTAACTTTTA 
AGCCCCCCGG AAATGAAATT TATCGCGACG GTAAGCTGTC TGTTtv 

WAAGCTGTC TGTTTGGGAA ATTGATGGGC 
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ACC AGT TTC AGC ACC AGA ACG AGG TAC ATG TTC CGA TGG GAT GAC CAG 
Thr Ser Phe Ser Thr Arg Thr Arg Tyr Met Phe Arg Trp Asp Asp Gin 
5 0 55 60 
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GGGAGAATGT CTTGTATTGT CAAAATCTTT GCCTGTTGGC AAAATGTTTT ATCAATTCTA 
AGACTTTGTA TTACGATGTT GAACCGTTTA TATTCTATAT TCTAACGGAG AGAGAGGATA 
CAGAGAACCA TCCCTATCAA AACGCAGCCA AATTCCATTT CGTAGGCTAT TTCTCCAAGG 6153 
AAAAATTCAA CTCCAATGAC TATAACCTAA GTTGTATTTT AACTCTACCC ATATACCAGA 6213 
GGAAAGGATA TGGTCAGTTT TTGATGGAAT TTTCATATTT ATTATCCAGA AAGGAGTCAA 62 73 

AATTTGGAAC TCCTGAAAAA CCATTGTCGG ATTTAGGATT ATTGACTTAC AGAACGTTTT 63 3 3 

GGAAGATAAA ATGTGCTGAA GTGCTATTAA AATTAAGAGA CAGTGCTAGA CGTCGATCAA 639 3 

ATAATAAAAA TGAAGATACT TTTCAG CAGG TTAGC CTAAA CGATATCGCT AAACTAACAG 64 53 

GAATGATACC AACAGACGTT GTGTTTGGAT TGGAACAACT TCAAGTTTTG TATCG CCATA 6513 
AAACACGCTC ATTATCCAGT TTGGATGATT TCAACTATAT TATTAAAATC GATTCTTGGA 657 3 

ACAGGATTGA AAATATTTAC AAAACTTGGA GCTCAAAAAA CTATCCTCGC GTCAAATATG 6 633 

ACAAACTATT GTGGGAACCT ATTATATTAG GGCCGTCATT TGGTATAAAT GGGATGATGA 
ACTTAGAACC CACCGCATTA GCGGACGAAG CTCTTACAAA TGAAACTATG GCTCCGGTAA 
TTTCGAATAA CACACATATA GAAAACTATA ACAACAGTAG AGCACATAAT AAACGCAGAA 
GAAGAAGAAG AAGAAGTAGT GAGCACAAAA CATCCAAGCT T 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 1..696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GAA TTC CAA TAC ACC AAA CAG CTG CAT TTC CCT GTG GGG CCC AAA TCC 48 
Glu Phe Gin Tyr Thr Lys Gin Leu His Phe Pro Val Gly Pro Lys Ser 
1 5 10 15 

ACA AAC TGT GAG GTA GCG GAA ATT CTT TTA CAC TGC GAC TGG GAA AGG 
Thr Asn Cys Glu Val Ala Glu He Leu Leu His Cys Asp Trp Glu Arq 
20 25 30 

TAC ATA AAT GTT TTA AGT ATA ACA AGA ACA CCA AAT GTT CCT AGT GGT 144 
Tyr He Asn Val Leu Ser He Thr Arg Thr Pro Asn Val Pro Ser Gly 
35 40 45 
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GGG CAA GGT TGC ATA TTA AAA ATA AGT TTT TGG GTG GAC TGG AAC GCA 
Gly Gin Gly Cys He Leu Lys He Ser Phe Trp Val Asp Trp Asn Ala 
65 70 75 80 

TCC AGT TGG ATC AAG CCA ATG GTA GAG AGC AAT TGT AAA AAT GGA CAA 
Ser Ser Trp He Lys Pro Met Val Glu Ser Asn Cys Lys Asn Gly Gin 
85 90 95 

ATT AGC GCC ACT AAG GAC TTG GTA AAG TTA GTC GAA GAA TTT GTA GAG 
He Ser Ala Thr Lys Asp Leu Val Lys Leu Val Glu Glu Phe Val Glu 
100 105 no 

AAA TAC GTG GAA TTG AGC AAA GAA AAA GCA GAT ACA CTC AAG CCG TTG 
Lys Tyr Val Glu Leu Ser Lys Glu Lys Ala Asp Thr Leu Lys Pro Leu 
115 120 125 

CCC AGT GTT ACA TCT TTT GGA TCA CCT AGG AAA GTG GCA GCA CCG GAG 
Pro Ser Val Thr Ser Phe Gly Ser Pro Arg Lys Val Ala Ala Pro Glu 

± - su 135 140 

CTG TCG ATG GTA CAG CCG GAG TCG AAA CCA GAA GCT GAG GCG GAA ATC 
Leu Ser Met Val Gin Pro Glu Ser Lys Pro Glu Ala Glu Ala Glu lie 
145 150 155 160 

TCA GAA ATA GGC AGC GAC AGA TGG AGG TTT AAC TGG GTG AAC ATA ATA 
Ser Glu He Gly Ser Asp Arg Trp Arg Phe Asn Trp Val Asn lie lie 
lfi 5 170 175 

ATC TTG GTG CTC TTG GTG TTA AAT CTG CTG TAT TTA ATG AAG TTG AAC 
He Leu Val Leu Leu Val Leu Asn Leu Leu Tyr Leu Met Lys Leu Asn 
180 185 190 

AAG AAG ATG GAT AAG CTG ACG AAC CTC ATG ACC CAC AAG GAC GAA GTT 
Lys Lys Met Asp Lys Leu Thr Asn Leu Met Thr His Lys Asp Glu Val 
195 200 205 

GTA GCG CAC GCG ACT CTA TTG GAC ATA CCA GCC CAA GTA CAA TGG TCA 
Val Ala His Ala Thr Leu Leu Asp He Pro Ala Gin Val Gin Trp Ser 
210 215 220 

AGA CCA AGA AGG GGA GAC GTG TTG TAACAGAGTA ATCATGTAAT ATTGTATGTA 
Arg Pro Arg Arg Gly Asp Val Leu biA 
225 230 

AGGTTATGTA TGTTCGTATG GTATGGAAAA AAAAAAAAAA AAAGGATGCT ATGTGGAGAA 
TGTAAGGCGT GGTAGCTCCG GATAATTCAG TCTGTAGGCT TCATCACGGG CAGTGGCCTG 
ACTCTGAGAG CTTGCTCCGG TATTAAGTTG TGCGTTTGAA ATTTTCTGGA AAAAAGAAAT 
TGATTGGTTG AAGCTATACT CGTCGAAAGA TTTCTTCGGC AGTGGTTGTT GCTCCACCTG 
CACGGGAGTT GTGTTTGCGT TTATGTTCGG CTTGG CTATA TTATTAGCGA GTGATGTTTG 
CAATTTGCTG TATTGAGAAT CAATTTGGGT GCGTAAGCTT TCAATAATTT TGCAGACCGC 
AGGCACTTCC AACTTTATGA GTTGCAGGTA TTCTCTTTTA TGAATATACG ATGACGACGA 
TGACGACGAC GCATCCATGC GCAAAAGCTC AGGGTGTCTA GATAGTTTGT TAGTCAATAA 
ATCCACATAT CTAAAATAAT AAATAAACGA CAGCGACAAG TCGTTGGCCT GGAACGCACA 
CTGTGCCTTT TCCAATATGC CGATGCATGT TTTCAGGTAA ATTCTCAATG GTATCG CCGG 
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ATTGAAGCGA TAATCCTTAG CGTCCTGAAC CAATTGCTTA CTAGACTTCA TGACCTACCG 13 86 

GGGCCAGATA AAGATGCGGA AGGAAGAGAA AAAATGTATA GTGGTTGGTG AACCGCAACA 14 4 6 

ATAATTCGTG CCAACACTTT AATCGAAGCA AAAATTGTCT TGTATGTTAT TAATATTATC 1506 

TATCTAACCA TTGATTTACG TATAAAACTG TCGATGCTCA TCGCCTAGCA ATGAAAAAAT 1566 

TTTTTCTTTT TTTTTTCATT ATTTCTCTTT GTTGCGTACT TTTTTTCATT GCGTTTCGCG 1626 

GCAAAAGCGA TTCGAGTTGA CTGGAAGTGT GTTATACTAT AAAAAGTGTA TATGCCTATT 168 6 

TTTGGTTCTG ATCTTTACTT TACTGTTAAG TACTGGCTGA GGCAGTAGAC TCTGCCTCTG 174 6 

TTACGG CAGC GGTATTCGCC TCGGCATCAG CAGCCGCCCA CGGTAGAGTA GGTTCTGTTG 18 06 

TTTTGACGTT TGCCAAGGTA CTGTCCAAAT GCTCCTTCAG CAAGGCCTCA TTACTTTCCT 1866 

TCTCCGGACC CACCGATTGC GTGATCTCCT GTACACGGTT CAAGAACTTG TTCAAATTGT 192 6 

AGCCCGCAGC AGCATCAGAG ACTTCTTGTG TGTAAGGGAC ACCCCTCAAC TCCTTGACTC 19 8 6 

TTCTTTTGTG CACTTTGCCC TTTAAATGCG TTTTTAACGC TATAGCAGTC TCCATGTATT 204 6 

TGGCACAGTG TATG CAATAG TGCTGAC CAA GGCCCGGTTT GGTTTCATCC AATGGCTGGT 2106 

TCAGAAG CTT CTGTACTGAT TCCTTGGTGG ACAAATCGTT ATAGATCAGG TCCAAGTCTC 2166 

GTGTTCTTCT TTTAGTCTTG TATCTCTTGA CCGAATATCT ACCCATGATG CGCTATTGTT 2226 

TTATCTTCAC TTGTCTGTGT GTTTAACTGC CTTTCAATTC ACCTCATCTC ATCTCCCGCT 2286 

ACTTTCCATA T AT AAAAG CA AAATTAATTT GCTTTTTCCC CTGTCAGTAT AAAAAAATTT 234 6 

TCCGCAGGAT ATAGAAAAAA AAGAAATGAA ATTATAGTAG CGGTTATTTC CGTGGGGTGC 24 06 

TTTTTTACAC CTGTACATCT TTTCCCTCCG TACATTTTTT TTATTTTTTT TTTGGGTTTT 24 66 

TTTTTTTCGA TATTTTTCCC TCCGAAACTA GTTAGCACAA TAATGCTGAC TAAGGAAACT 2526 

TTTCATCTCA GAATTGATGG TCAGTTTGGT TTCTCTAGAG AAT AG TTT AT AAAAAGATGT 2586 

TGATGTGGAG CAACCATTTA TACATC CTTT CCGCAAGTGC TTTTGGAGTG GGACTTTCAA 2 646 

ACTTTAAAGT ACAGTATATC AAATAACTAA TTCAAGATGG CTAGAAGACC AGCTAGATGT 2706 

TACAGATACC AAAAGAACAA GCCTTACCCA AAGTCTAGAT ACAACAGAGC TGTTCCAGAC 2766 

TCCAAGATCA GAATCTACGA TTTGGGTAAG AAGAAGGCTA CCGTCGAT 2814 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 14: 

He Asn Leu Lys Ala Leu Ala Ala Leu Ala Lys Lys He Leu 
1 5 10 
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WHAT IS CLAIMED IS: 

1- A host cell transformed or transfected with DNA 

comprising: 

a repressor gene encoding a repressor protein, said 
repressor gene under transcriptional control of a promoter; 

a selectable marker gene encoding a selectable marker 
protem; said selectable marker gene under transcriptional 
control of an operator; said operator regulated by interaction 
with said repressor protein; 

a first recombinant fusion protein gene encoding a first 
binding protein or binding fragment thereof in frame with 
either a DNA binding domain of a transcriptional activating 
protem or a transactivating domain of a transcriptional 
activating protein; and 

a second recombinant fusion protein gene encoding a 
second binding protein or binding fragment thereof in frame 
with either a DNA binding domain of a transcriptional 
activating protein or a transactivating domain of a 
transcriptional activating protein, whichever domain is not 
encoded by the first fusion protein gene, said second binding 
protein or binding fragment thereof capable of interacting with 
said first binding protein or binding fragment thereof such that 
interaction of said second binding protein or binding fragment 
thereof and said first binding protein or binding fragment 
thereof brings into proximity a DNA binding domain and a 
transactivating domain forming a functional transcriptional 
activating protein; said functional transcriptional activating 
protein acting on said promoter to increase expression of said 
repressor gene. 
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2. The host cell of claims 1 wherein said DNA binding 
domain and said transactivating domain are derived from a common 
transcriptional activating protein. 

3. The host cell of claim I wherein one or more of the 
repressor gene, the selectable marker gene, the first recombinant fusion 
protein gene, and the second recombinant fusion protein gene arc encoded on 
distinct DNA expression constructs. 

4. The host cell of claim 1 wherein said selectable marker 
protein is an enzyme in a pathway for synthesis of a nutritional requirement 
for said host cell such that expression of said selectable marker protein is 
required for growth of said host cell on media lacking said nutritional 
requirement. 

5. The host cell of claim 1 wherein said host cell is a yeast cell 
or a mammalian. 

6. The host cell of claim 2 wherein said selectable marker gene 

encodes HIS 3; 

7. The host ceil of claim 2 wherein said repressor protein gene 
encodes a tetracycline resistance protein; 

8. The host cell of claim 2 wherein said operator is a tet 

operator. 

9. The host cell of claim 2 wherein said promoter is selected 
from the group consisting of the LexA promoter, the alcohol dehydrogenase 
promoter, the Gal4 promoter. 
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10. The host cell of claim 2 wherein said DNA binding domain 
denved from a protein selected from the group consisting of LexA and Gal4. 

11. The host cells of claim 2 wherein said transactivating 
domam is derived from a protein selected from the group consisting of VP16 
and Gal4. 

12. The host cell of claim 2 wherein the first binding protein 
is CREB and the second binding protein is CBP. 

13- The host cell of claim 2 wherein the first binding protein 
is Tax and the second binding protein is SRF. 

14. The host cell of claim 2 wherein the first binding protein 
is casein kinase I and the second binding protein is CREB. 

15. The host cell of claim 2 wherein the first binding protein 
1S AKAP 79 and the second binding protein is selected from the group 
consisting of RI, Rfl and calcineurin. 

16. A method to identify an inhibitor of binding between a first 
bmding protein or binding fragment thereof and a second binding protein or 
binding fragment thereof comprising the steps of: 

a) growing host cells of any one of claims 1 through 15 in 
the absence of a test compound and under conditions 
which permit expression of said first binding protein or 
binding fragment thereof and said second binding 
protein or binding fragment thereof such that said first 
binding protein or fragment thereof and second binding 
protein or binding fragment thereof interact bringing 
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into proximity said DNA binding domain and said 
transactivating domain forming said functional 
transcriptional activating protein; said transcriptional 
activating protein acting on said promoter to increase 
expression of said repressor protein; said repressor 
protein interacting with said operator such that said 
selectable marker protein is not expressed; 

b) confirming lack of expression of said selectable marker 
protein in said host cell; 

c) growing said host cells in the presence of a test 
compound; and 

d) comparing expression of said selectable marker protein 
in the presence and absence of said test compound 
wherein increased expression of said selectable marker 
protein is indicative that the test compound is an 
inhibitor of binding between said first binding protein or 
binding fragment thereof and said second binding 
protein or binding fragment thereof. 

17. The method of claim 16 wherein 
the host cell is a yeast cell; 
the selectable marker gene encodes HIS3; 
transcription of the selectable marker gene is regulated 
by the tet operator; 

the repressor protein gene encodes the tetracycline 
resistance protein; 

transcription of the tetracycline resistance protein is 
regulated by the LexA promoter; 
the DNA binding domain is derived from LexA; and 
the transactivating domain is derived from VP] 6. 
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18. The method of claim 16 wherein 
the host cell is a yeast cell; 
the selectable marker gene encodes HIS3; 
transcription of the selectable marker gene is regulated 
by the tet operator; 

the repressor protein gene encodes the tetracycline 
resistance protein; 

transcription of the tetracycline resistance protein is 
regulated by the alcohol dehydrogenase promoter; 
the DNA binding domain is derived from Lex A; and 
the transactivating domain is derived from VP16. 

19. A kit to identify an inhibitor of binding between a first 
binding protein or binding fragment thereof and a second binding protein or 
binding fragment thereof, said inhibitor identified by the method of claim 16 
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1. Claims: 1-12, 16-19 

A host cell transformed or transfected with DNA comprising: 
a repressor gene encoding a repressor protein, said 
repressor gene under transcriptional control of a promoter; 
a selectable maker gene encoding a selectable protein; said 
marker gene under transcriptional control of an operator; 
said operator regulated by interaction with said repressor 
protein; a first recombinant fusion protein gene encoding a 
first binding potein or binding fragment thereof in frame 
with either a transactivator domain of a transcriptional 
activator protein; and a second recombinant fusion protein 
or binding fragment thereof in frame with either a DNA 
binding domain of a transcriptional activating protein, 
whichever domain is not encoded by the first fusion protein 
gene, said second binding protein or binding fragment 
thereof capable of interacting with said first binding 
protein or binding fragment thereof such that interaction of 
said second binding protein or binding fragment thereof 
brings into proximity a DNA binding domain and a 
transactivating domain forming a functional transcriptional 
activating protein; said functional transcriptional 
activating protein acting on said promoter to increase 
expression of said repressor gene; said DNA binding domain 
and said transactivating domain are derived from a common 
transcriptional activating protein; one or more of the 
repressor gene, the selectable marker gene, and the first 
and second recombinant fusion protein genes, are encoded on 
distinct DNA expression constructs; said host cell wherein 
said selectable marker protein is an enzyme; said host cell 
is a yeast cell or a maimialian; said selectable marker gene 
encodes HIS3, said repressor protein gene encodes a 
tetracyline resistant protein; said operator is a tet 
operator; said promoter is selcted from the group consisting 
of the LexA-, the alcohol dehydrogenase-, the GA14-promoter; 
said DNA binding domain derived from a protein from the 
group consisting of LexA and Gal4; said transactivating 
domain is derived from a protein selected from the group 
consisting of VP16 and Gal4; said first binding protein is 
CREB and the second binding protein is CBP; a method and kit 
to identify an inhibitor of binding between a first and a 
second binding protein using said host cell* 



2. Claim : 13 

The host cell of subject one but wherein the first binding 
protein is Tax and the second binding protein is SRF. 
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3. Claim : 14 



The host cell of subject one but wherein the first binding 
protein is casein kinase I and the second binding protein is 
CREB. 



4. Claim : 15 

The host cell of subject one but wherein the first binding 
protein is AKAP 79 and the second binding protein is 
selected from the group consisting RI, RII and calcineurin. 
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